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PREFACE 


The aim of this edition, as of the first edition, is to provide an under- 
standing of each of the major learning theories in relation to the experi- 
ments to which that theory gives rise. The years since the first edition 
have seen an enormous expansion of psychological research in learning 
and no dearth of theoretical contributions. The task of providing an intro- 
ductory exposition that is at once clear and critical becomes increasingly 
difficult. 

I have chosen to deal with the major positions that flourished in the 
first half of the twentieth century in much the way they were dealt with 
in the first edition, building the exposition around the psychologists who 
gave currency to the several theories. Thus the chapters on Thorndike, 
Guthrie, Hull, Skinner, Tolman, classical gestalt, Lewin, and functional- 
ism are repeated, though each has been thoroughly rewritten. New chap- 
ters include one on the learning significance of Freud’s psychodynamics, 
one on mathematical models, and two on other current developments. 
The over-all result is a book of one-third more pages. Despite the orienta- 
tion toward historical antecedents of current theory, nearly half the 
literature cited is new to this edition. 

In the effort to keep the book from becoming unduly large, some of 
the material from the first edition has not been incorporated in the re- 
vision. The chapter on Wheeler’s views is omitted with the regret that 
neither he nor any of his followers has revised the position sufficiently to 
keep it currently active. Much of the material from the dropped chapter 
on theories influenced by field conceptions has been treated in other ways, 
but some of it has been lost in the revision. So, too, Hulls 1943 postulates 
have had to give way to allow a critical exposition of his final 1952 system. 
For some purposes, therefore, the present edition does not completely re- 
place the earlier one, and occasional references are made in the new 
edition to non-repeated material in the earlier one. 

The two chapters on current developments recognize that the ad- 
vances in mid-century cannot be confined to the standard positions that 
became defined in the first half of the century. Younger psychologists 
who began as students of one or another of the masters have matured as 
theorists in their own right and can no longer be subordinated as disciples 
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or defenders of older views. In the new chapters I have tried to trace the 
connections with the earlier positions, while recognizing that we are mov- 
ing forward, and debate is no longer confined to old issues. Some mem- 
bers of the new generation of theorists no doubt now deserve chapters in 
their own right; but until the new viewpoints are somewhat better crys- 
tallized, the necessary distinctions would undoubtedly prove confusing 
to the student who is trying for the first time to discover the identifying 
characteristics of the longer established theories. 

It is my hope that this book may assist the student of the psychology 
of learning and his teachers to find their way into the current experi- 
mental and theoretical literature of learning — a literature now increas- 
ingly forbidding without a guide. While my task is expository and not 
polemical, I have not refrained from expressing personal judgments when 
these seemed appropriate to me. I hope, however, that these personal in- 
trusions will be recognized for what they are, and that they remain dis- 
tinguishable from the more objective exposition of the theories here con- 
sidered. In the final chapter on retrospect and prospect I have departed 
from the expository style of the earlier chapters in order to estimate where 
we now stand on the more important isssues and to indicate the paths 
that seem likely to lead to clarification and ultimate agreement. 

Suggestions that have changed the second edition have come from 
many students, colleagues, and critics, most of whom will have to go un- 
named. I cannot refrain from mentioning the patient reading and helpful 
comments of the editors of this series, R. M. Elliott and Kenneth Mac- 
Corquodale. Individual chapters benefited through critical reading by 
Douglas H. Lawrence, Robert R. Sears, Morris I. Stein, Patrick C. Suppes, 
and Thurlow R. Wilson. I am grateful for permission to use as yet un- 
published materials from Neal E. Miller, O. Hobart Mowrer, B. F. Skin- 
ner, and Kenneth W. Spence. The typing was done by Miss Irene Payne 
and the new charts drafted by P. Stanley King. 

The following publishers permitted the reproduction of figures or 
quotations: Appleton-Century-Crofts, Inc., Cambridge University Press. 
Columbia University Press, Cornell University Press, Duke University 
Press, Free Press, Harper and Brothers, Harvard University Press, Ho- 
garth Press, Henry Holt and Company, Houghton Mifflin Company, 
Kegan Paul, Trench, Trubner and Company, Liveright Publishing Corpo- 
ration, McGraw-Hill Book Company, Inc., The Macmillan Company, 
National Society for the Study of Education, Prentice-Hall, Rinehart and 
Company, Ronald Press, Stanford University Press, Teachers College, 
Columbia University, University of Oregon Press, John Wiley and Sons, 
Inc., Yale University Press. The publishers of the following journals and 
monograph series have permitted quotations from these sources: Ameri- 
can Journal of Psychology , American Psychologist , Contributions to Psy - 
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chological Theory , Genetic Psychology Monographs , Journal of Abnormal 
and Social Psychology , Journal of Comparative Psychology , Journal of 
Comparative and Physiological Psychology , Journal of Experimental Psy- 
chology , Journal of General Psychology , Journal of Psychology , Psycho- 
logical Bulletin , Psychological Review , University of California Publica- 
tions in Psychology , University of Iowa Studies in Child Welfare . The 
original source is cited in each case. 


Stanford , California 


E.R.H. 
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Chapter 1 

THE NATURE OF LEARNING 
THEORIES 


rpHE scientific study of learni ng is carried on primarily by psycholo-_ 
— claim to the field was staked In part by masterly 
pioneers such as Ebbinghaus (1885), 1 Bryan and Harter (1897, 1899), 
and Thorndike ( 1898 ) .?*Those who have followed in their footsteps have 
been primarily psychologists. Professional educators have welcomed edu- 
cational psychology as a foundation science upon which to build their 
practices, and s tudie s of learning have gone on concurrently in labora-^ 
tories of general psychology and raboratories of educational psychology, 
withTnferpIay between the pure and applied fields. Under the circiun- 
stances7 it is very natural for psychologists to feelthat the study of 
learning belongs to them. 

In addition to historical reasons, there is another basis on whi ch to/ 
accounTtOr^e^psyeholog^t-s -interest in learningr This isThe centrality 
of learning in the more ge neral system s oF psychological theory. A scien - 
tistTalong with the desire to satisfylhs curiosity about the facts of nature, 
lui.> a predilection for ordermg_ his f acts i n to systems of l a ws and Jheorles . 
He is interested not only in verified facts and relationships, but in neat 
and parsimonious ways of summarizing these lacls. Psychologists with 
a penchant for systems find a theory of learning essential because so 
much of mans diverse behavior is the result of learning. If the nflbl 
"'diversity of behavior is to be understood in accordance with a few princi-l 
pies, it is evident that some of these principles will have to do with the 
way in which learning comes about. / 

Many psychologists make explicit acknowledgment of the centrality 
of learning in their broader systems. Three examples may be cited. 

1. In his definition of behavior as molar rather than as molecular (a 

1 References cited can be found by author and date in the list at the end of the book. 

1 
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THEORIES OF LEARNING 
distinction that lies at the very heart of his system), Tolman 2 lists docility 
( “teachableness” ) as the crowning characteristic of such behavior. All 
molar behavior is docile. Hence learning becomes for Tolman an idem 
tifying character of t ha t which he wishes to i nclu d e as b ehavior. 

*27nGuthrie^makes of learning the mark of mind. As he puts it: 

The ability to learn, that is, to respond differently to a situation because of 
past response to the situation, is what distinguishes those living creatures which 
common sense endows with minds. This is the practical descriptive use of the 
term 4 mind/' 3 

3. Hull, 4 in introducing his theory of the behavior sciences, finds it 
natural to devote the first volume of his series to learning theory. He 
Scarcely distinguishes between a theory of learning and a theory of be- 
havior, so important is learning in his conception of behavior. While his 
books are primarily about learning, their titles reflect the intended gen- 
erality: Principles of behavior , A behavior system . 

Although not all psychologists give this same prominence to learn- 
ing in their theories, the fact that others do makes it imperative for all to 
treat the problems of learning in one way or another. Hence the sys- 
tematic aspects of learning theory have come to be important to all 
psychologists interested in more general theories. 

THE DEFINITION OF LEARNING 

What Learning Includes 

There are many activities which everyone will agree count as il- 
lustrations of learning: acquiring a vocabulary, memorizing a poem, 
learning to operate a typewriter. There are other activities, not quite as 
obviously learned, which easily classify as learned, once you have re- 
flected upon them. Among these are the acquiring of prejudices and 
preferences and other social attitudes and ideals, including the many 
skills involved in the social interplay with other people. Finally there 
are a number of activities whose acquisition is not usually classifiable as 
a gain or improvement because their utility, if such there be, is not 
readily demonstrable. Among these are tics, mannerisms, autistic gestures. 

Such a pointing to illustrations of learning serves very well as a first 
approximation to a definition. It is, in fact, extremely difficult to write 
an entirely satisfactory definition. Although we are tempted^to define 

2 Totocian (1932b), pages 14-16. 

3 Guthrie ( 1952), page 3. 

* Hull (1943). 
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learning as improvement with practice, or as profiting by experience, we 
know very well that some learning is not improvement, and other learn- 
ing is not desirable in its consequences. To describe it alternatively as any 
change with repetition confuses it with growth, fatigue, and other changes 
which may take place with repetition. The following definition may be 
offered provisionally: 

Learning is the process by which an activity originates or is changed through 
reacting to an encountered situation, provided that the characteristics of the 
change in activity cannot be explained on the basis of native response tendencies, 
maturation, or temporary states of the organism (e.g., fatigue, drugs, etc.) . 

The definition is not formally satisfactory because of the many un- 
defined terms in it, but it will do to call attention to the problems involved 
in any definition of learning. The definition m ust disti ngu ish between (1) 
the kinds of changes, and their correlated antecedents, which are in- 
cluded as learning, and (2) the related kinds of changes, and their 
antecedents, which are not classified as learning. We can now go on to 
consider some of the changes that are excluded by our provisional defini- 
tion. 


Native Respon se Ten dencies verms Learning 

The older^atalQgs.-Qfii^ un- 

learned activities the re fle xes (such as pupillary constriction lo^light) , 
the bropisms (such as a moth’s dashing into a flame),, and the -.instincts. 
(such as a birds nest-building). We may continue to ack now ledge s uch 
activities as. species chara cteris tics. Ajtgr a period during which it was 
ratKef *emotionally tabooed, the term ^instinct’’ is again part. of the_.psyr 
chologist’s vocabulary, referring to complex unlearned activity charac- 
teristicdf aspeci”es?73»~ 

The problem of instinct versus learning is not solved by attempting 
to classify some behavior as instinctive in its entirety and other behavior 
as learned in its entirety. The in-between nature of behavior wit& large 
instinctive components is well illustrated by the experiments on im- 
printing . 6 A young duckling, for example, is prepared instinctively to 
accept a certain range of mother-figures, characterized by size, move- 
ment, and vocalization. Once such a mother-figure has been accepted 
and followed about, only this particular mother now satisfies the instinc- 
tive demand. The selected mother (who may be Professor Lorenz crawl- 
ing on hands and knees ) has become imprinted, and is the only 4 mother' 

5 Tinbergen ( 1951 ). 

•E.g., Lorenz (1952), Ramsay (1951). 
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the duckling will now follow. Imprinting is then a form of learning, bul 
a form very closely allied to the instinctive propensities of a particulai 
kind of organism of a particular age. (We lack at present evidence foi 
imprinting except in fowls and other birds. ) The problem of distinguish- 
ing between Jhe^instinctive and learned com ponent?^ 
illustrated by imprinting is an exp er im en tal_pr^ for its 

clarification- upon the ingenuity of the experimenters^ 
propriate control exper iments. 

Maturation versus L earning 

Growth JsJ^rmng^^ a n^ffie^ If 

a behavior sequence matures through regular stages irrespective of inter- 
vening practice, the behavior is said to develop through maturation and 
not through learning. If training procedures do not speed up or modify 
the behavior, such procedures are not causally important and the changes 
do not classify as learning.^Relatively pure cases like the swimming of 
tadpoles and the flying of birds can be attributed primarily to matura- 
tion. Many activities are not as clear-cut, but develqg^tfirqugh a^compjex 
interplay" of maturafidn and leaming^ jLconvenient illustration is the de^ 
velopment of language in the child. The child does not learn to talk 
until old enough, but the language which he learns is that which he hears. 
In such cases it is an experimental problem to isolate the effects of 
maturation and of learning. The ambiguity in such cases is one of fact, 
not of definition. 

Fatigue versus L^arnin^ 

When activities are repeated in rapid succession, there is jpftgp a 
loss in efficiency commonly attributed to fatigue. S uch changes in per- 
formance are called work decrements in the experimental laboratory* 
The units of a work curve are like those of a practice curve: performance 
plotted against trials or repetitions. Hence the experimental arrangements 
in obtaining a work curve are essentially those of a learning procedure 
and, at first sight, it appears to be a form of question-begging to define 
the processes involved by the results obtained. It would be question - 
begging, however only^ 

cEangeln performance^ Actually both learning and fatigue are inferences 
IronTtHe performances, and it is permissible 

the~“obfambd perfdfmahceT'require or suggest. Fatigue curves tend tp 
show decreasing proficiency with repetition and recovery with restsL 
Learning curves ordinarily show gains with repetitions and forgetting 
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over rests., These typical differences between learning effects, and fatigue 
effects avi evident enough, but the inferences from performance are made 
on somewhat more complex evidence. It is because of the complexity of 
these inferences that it is difficult to state a concise definition of learning 
which will conserve the learning inferences from performance while 
eliminating the fatigue inferences. The problem is logically the same as 
distinguishing changes due to maturation and to learning. But again the 
ambiguity is one of fact, not of definition. 

Learning always must remain an inference from performance, and 
only confusion results if performance and learning are identified. A clear 
illustration is provided by performance under the influence of drugs or 
intoxicants. The fact that learned behavior fails when the organism is in 
such a state does not mean that forgetting has occurred. When the normal 
state has been restored, the performance may return to normal levels 
although there has been no intervening training. 

Learning and the Nervous System 

Some definitions of learning avoid the problem of performance by 
defining learning as a change in the central nervous sysfcenrr'So long as 
this change in the nervous system persists, temporary dnanges in state, 
such as those in fatigue and intoxication, affect performance but not 
learning. This definition asserts that learning is an inference, but it goes 
on to make a particular sort of inference about the role of the nervous 
system in learning. In view of the lack of knowledge of what actually 
does take place inside the organism when learning occurs, it is preferable 
not to include hypothetical neural processes in the definition of learning. 
We know that learning takes place. We should therefore be able to define 
what we are talking about without reference to any speculation what- 
ever. This position does not deny that what we are calling learning may 
be a function of nervous tissue. It asserts only that it is not necessary to 
know anything about the neural correlates of learning in order to know 
that learning occurs. 

Learning , Problem-Solving, and Reasoning 

After you have learned, there are many things which you are able 
to do. If you can add and subtract, you can solve many novel problems 
without learning anything new. Where the solution of problems is rela- 
tively mechanical (as in addition and subtraction), the problem may 
be thought of as merely the exercise or utilization of a learned bit of be- 
havior. When, however, there is greater novelty, more putting of things 
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into relationship, as in reasoning or inventiveness, the process is interest- 
ing in its own light, and is not to be described simply as the running 
off of old habits. 

The question has been raised, especially by Maier , 7 as to the appro- 
priateness of including processes like reasoning within the same classifi- 
cation as other kinds of learning. My preference is for including them. 
Leaving them in does not prejudge their explanation. There may be 
several kinds of learning from the simpler to the more complex, not all 
following the same principles. If so, we have no assurance that the only 
sharp break comes when “reasoning” appears. Leaving the doubtful 
processes in simply asserts that a complete theory of learning must 
have something to say about reasoning, creative imagination, and in- 
ventiveness, in addition to what may be said about memorizing and 
retaining or about the acquisition of skill. 

Definition Not a Major Source of Disagreement Between 

Theories 



While it is extremely difficult to formulate a satisfactory definition 
oif learning so as to include all the activities and processes which we wist 
to include and eliminate all those which we wish to exclude, the diffi- 
culty does not prove to be embarrassing because it is not a source of 
controversy as between theories. The controversy is over fact and in- 
terpretation, not over definition .yThere are occasional confusions over 
definition, but such confusions 'may usually be resolved by resort to 
pointing, to denotation. For the most part it is satisfactory to continue to 
mean by learning that which conforms to the usual socially accepted 
meaning that is part of our common heritage. Where distinctions have 
to be made with greater precision, they can be made through carefully 
specified types of inference from experiments. 


SOME TYPICAL PROBLEMS CONFRONTING 
TLEARNING THEORIES^ 

The preferences of the theorist often lead him to concentrate upon 
one kind of learning situation to the neglect of the others. His theory is 
then appropriate to this situation, but becomes somewhat strained in rela- 
tion to other problems of learning. A comprehensive learning theory 
ought-J:o_answer the questions which ^ 
might^sk abouFtfe^sorts"“of learning which are met in everydayTife. 

7 Maier (1931). 
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A few such questions will be listed here, and then used later in apprais- 
ing the theories which different writers present, 

1. What are the limits of ;Jggn?in^ffJ.ere is raised the questio n of 

the cap acity to l earn, of in dividual differences i_ampM 
sam ejspecies and of unlike species. There are questions not only of per- 
sistent differences in capacity, but of change in capacity wi th a ge. Who 
can learn what? Arc the limits sct.at^Bi^TTDcTpeopre^ get more or less 
alike wiflrpractice? These are the sorts of questions which it is natural 
to raise. / 

2. What is the role^£jpmctwe~-in„ leaming?The old adage that 
practice makes perfect has considerable racial wisdom behind it. Surely 
one learns to roller skate or to play the piano only by engaging in the 
activity. But what do we know about practice in detail? Does improve- 
ment depend directly y>n the amount of repetition? If not, whaFare its- 
conditi ons? What are the mosnivorable circumstances of practice? 
Can repetitive drill be harmful as well as he lpf ul to the learner? 

3. How important arejdjives_ and- irwentims^zewards and punish- 


3 * 


merits ? "Everybody Jmows i n a g eneral way that learning can be con- 
trolle d by rewards and punishments, and that it is easier to learn some- 
thing which is interesting than something which is dull. But are the con 
| sequences of rewards and punishments equal and opposite? Is there aj 
* difference between intrinsic and extrinsic motives in their effect upon 
learning? How do goals and purposes affect the process? 

4. What is the place of understanding and insight? Some things are 
learnedhmbrTreadily if we know what we are about. We are better off 
as travelers if we can understand a timetable or a road map. We are 
helpless with differential equations unless we understand the symbols 
and the rules for their manipulation. But we can form vowels satisfac- 
torily without knowing how we place our tongues, and we can read 
without being aware of our eye movements. Some things we appear to j 
acquire blindly and automatically; some things we struggle hard td 
understand and can finally master only as we understand them. Is learn- 
ing in one case different from what it is in the other? 

5. Does learning one thing help you learn something else? This is 
the pr oblem of formal discipline, as it usecfto be caHeBTor^j ff tra nsfer 
of training, to use a more familiar contemporary designation. Some 

Transfer ”^f "training must occur or there would be no use in developing 
a foundation for later learning. Nobody denies that it is easier to build 
a vocabulary in a language after you have a start in it, or that higher 
mathematics profits from mastery of basic concepts. The question is really 
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one of how much transfer takes place, under what conditions, and what 
its nature is. 

6. What happens whe n we remember cind^ when tve forge t? The 
ordinary facts of memory are myslenous enough, but in addition to 
familiar remembering and forgetting, our memories may play peculiar 
tricks on us. Some things we wish to remember are forgotten; some 
things we would be willing to forget continue to plague us. In cases 
of amnesia there are often gaps in memory, with earlier and later events 
remembered. Then there are the distortions of memory, in which we re- 
member what did not happen, as is so strikingly demonstrated in testi- 
mony experiments. What is taking place? What control have we over 
processes involved? 

These six questions will serve as useful ones to ask of each of the 
major theories. They suffice to illustrate the kinds of questions which 
give rise to theories of learning. 


ISSUES ON WHICH LEARNING THE 



■HEirthe pfe^<tog^ctionwe asked certain common-sense questions 
about learning, on the assumption that a good learning theory should 
have something to say about each of them. Such questions can be raised 
before we know anything about actual learning theories. Now we wish 
to turn, however, to certain issues that have arisen in the formulation of 
actual theories. By alerting us in this way to what is to follow, we are 
better prepared for some of the differences in flavor that we shall meet 
as we review one theory after another. 

Learning theories fall into two ma jor families; stimulus-response 
leofies afid cu wnMoe^ieo nes. but not all theories belong to these two 
family theories include such diverge members 

as th e the ories oTTfaOrnllikerGu^^ IfulL, ^e cognitive.., 

theories Jn 3^ leasi ftKose ofToIn^ gestalt psycholo- 

gists, an d JLe win . Not completely and clearly classi^able in^ these ferms w 
are t he theories of ftmMonalism, psychodyhamics, and the pro ba bilistic 
theoriesqf the model builders. The lhiesoF cleavage between the two 
'“families of theories are nol the only cleavages within learning theories; 
there are other specific issues upon which theories within one family may 
differ. 
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General Issues Producing a Cleavage Between Stimulus- 

Response and Cognitive Theories 

The cleavages between the theorists of opposing camps are diffi- 
cult to understand because many of the distinctions which at first seem 
to contrast sharply later are found to be blurred. All reputabl e theorists 
accept a common logic of experimentation, so that dis agr e ements^p ver 
exper imentally^obtal.nM„ia cts a re readijv arbitr^cL In th e end, al l the 
theoris ts accept a common -body of- demonstrated- relationships, at the 
fact ua l or descrip tive level; any theorist who denied- an established fact, 
a reproducible experimental finding, would lose status among his scien- 
tific colleagues, and his theories would no longer command respect.. The 
firsF rule that we must be prepared to accept, as we judge the relative 
merits of different theories is this: All the theo r ists accept all of theJacis. 
Some experimental findings are doubted when they are first announced 
and the status of findings as fact may for a long time be doubted; but 
once the status as fact is established, all accept the fact as true. Hence 
the differences between two th e orists ar e primarily differences in inter- 
pretation. Both theories may fit the facts reasonably well, but the pro- 
ponenFbf each theory believes his view to be the more fruitful. We shall 
be better prepared later on to discuss the ways in which theories get 
Validated or modified after we are acquainted with them in more detail. 
For the present, we must be prepared to accept the historical truth that 
opposing theories have great survival value, and that an appeal to the 
facts as a way of choosing between theories is a very complex process, 
not nearly as decisive in practice as we might expect it to be. 

We may begin by examinin g three k inds of preferences on which 
stimulus-responsej&eonsts tend to differ from cognitive theorists. 

1 . “Peripheral” versus “ Central 9 Intermediaries 

Ever since Watson promulgated the theory that thinking was merely 
the carrying out of subvocal speech movements, stimulus-response 
theorists have preferred to find response or movement intermediaries to 
sexW as integrators of behavior sequences. Such movement-produced 
intermediaries can be classified as "peripheral” mechanisms, as contrasted 
with "centra 1” (ideational) intermediaries. The stimulus-response theo- 
rist tends to believe that some sort of chained muscular responses, linked 
perhaps by fractional anticipatory goal responses, serve to keep a . rat 
running to a distant food box. The cognitive theorist, on the other hand, 
more freely infers central brain processes, such as memories or expec- 



10 THEORIES OF LEARNING 

tations, as integrators of goal-seeking behavior. The differences in pref- 
erence survive in this case because both kinds of theorists depend upon 
inferences from observed behavior, and the inferences are not directly 
verified in either case. It is potentially easier to verify tongue movements 
in thinking than it is to discover a revived memory trace in the brain, but 
in fact such verification is not offered with the precision necessary to com- 
pel belief in the theory. Under the circumstances, the choice between the 
peripheral and the central explanation is not forced, and favoring one or 
the other position depends upon more general systematic preferences. 

2. Acquisition of Habits versus Acquisition of Cognitive 
Structures 

The stimulus-response theorist and the cognitive theorist come up 
with different answers to the question, What is learned? The answer 
of the former is "habits”; the answer of the latter is “cognitive structures.” 
The first answer appeals to common sense: we all know thaj; we develop 
smooth-running skills by practicing them; what we learn is responses. 
But the second answer also appeals to common sense: if we locate a candy 
store from one starting point, we can find it from another because we 
“know where it is”; what we learn is facts . A smooth-running skill illus- 
trates a learned habit; knowing alternate routes illustrates cognitive struc- 
ture. If all habits were highly mechanical and stereotyped, variable non^ 
habitual behavior would force us to admit cognitive structures as part, at 
least, of what is learned. But the stimulus-response psychologist is satis- 
fied that he can deduce from the laws of habit formation the behavior 
that the cognitive theorist believes supports his interpretation. Hence we 
cannot choose between the theories by coming up with “decisive” illustra- 
tions of what we learn, for both groups of theorists will offer explanations 
of all our examples. The competing theories would not have survived 
thus far had they been unable to offer such explanations. 

3 . Trial and Error verms Insight in Problem-Solving 

When confronted with a novel problem, how does the learner reach 
solution? The stimulus-response psychologist finds the learner assem- 
bling his habits from the past appropriate to the new problem, respond- 
ing either according to the elements that the new problem has in common 
with familiar ones, or according to aspects of the new situation which 
are similar to situations met before. If these do not lead t& solution, the 
learner resorts to trial and error, bringing out of his behavior reper- 
tory one response after another until the problem is solved. The QOg- 
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nitive psychologist agrees with much of this description of what the 
learner does, but he adds interpretations not offered by the stimulus- 
response psychologist. He points out, for example, that granting all the 
requisite experience with the parts of a problem, there is no guarantee 
that the learner will be able to bring these past experiences to bear upon 
the solution. He may be able to solve the problem if it is presented in one 
form fnd not solve it if it is presented in another form, even though 
both forms require the same past experiences for their solution. Ac- 
cording to the cognitive theorist, the preferred method of presentation 
permits a perceptual Structuring leading to “insight,” that is, to the 
understanding of the essential relationships involved. The stimulus-, 
respond psychologist tends, by preference, to look to the past history 
of the learner for the sources of solution, while the cognitive psychologist, 
by preference, looks to the contemporary structuring of the problem. 
His preference for the past does not require the stimulus-response psy- 
chologist to ignore the present structuring of the problem, nor does 
his preference for the present require the cognitive psychologist to ig- 
nore the past. One must not assume because there is a difference in pref- 
erence that either theorist is blind to the totality of the learning situa- 
tion. The facts of the insight experiment are accepted by both theorists, 
as are the facts of skill learning. We may remind ourselves again that 
feo single experiment will demolish either the interpretation according 
To trial and error or the interpretation according to insight. 

These three issues-^eripheral versus central intermediaries, ac- 
quisition of habits versus acquisition of cognitive structures, and trial and 
error versus insight in problem-solving — give something of the flavor 
of the ^differences between these t\fb major families of theories. 

Specific Issues Not Confined to the Major Families 

Some issues lie outside the conflict between the stimulus-response 
and the cognitive theories. Thus ^wo stimulus-response psychologists 
may differ as to the role of reinforcement in learning, and two cognitive 
theorists may differ as to the necessity for a physiological explanation of 
learning. Three of these issues will suffice to alert us to the many prob- 
lems that learning theorists face. 

1 . Contiguity versus Reinforcement 

The oldest law of association is that ideas experienced together 
tend to become associated. This has come down in one form or another 
to the present day as the principle of association by contiguity, although 
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it is now more fashionable to describe the association as between stimuli 
and responses rather than as between ideas. Several of our contemporary 
theorists accept the principle of contiguous association, notably Guthrie 
(a stimulus-response psychologist) and Tolman (a cognitive psycholo- 
gist). Other theorists insist that learning does not take place through 
contiguity alone, unless there is some sort of reinforcement, some equiv- 
alent of reward or punishment. 

2 . One or More Kinds of Learning? 

The contiguity-reinforcement dilemma may be resolved by accept- 
ing both, thus defining two varieties of learning. This solution has ap- 
pealed to theorists such as Thorndike and Skinner and Mov^r. But 
these two varieties are not the only possibilities. Perhaps by using the 
common name "learning” to cover the acquisition of motor skills, the 
memorization of a poem, the solving of a geometrical puzzle, and the 
understanding of a period in history, we are deceiving ourselves by look- 
ing for common laws explanatory of processes that have little in common. 

Hence the theorist has to choose between a single-factor theory and 
a multi-factor one. Tolman at one time pointed to the possibility of seven 
kinds of learning. 

3 . Intervening Variables versus Hypothetical Constructs 8 

We have already considered a contrast between two types of inter- 
mediary, the peripheral and the central types. But as theories become 
more refined, additional problems arise over the way in which inferred 
intermediaries should be specified. One kind of intermediary found in 
theories is a mathematical constant that reappears in various contexts, 
such as the acceleration of a free-falling body (g) that appears in equa- 
tions describing the movement of a pendulum, the path of a projectile, 
or the way in which balls roll down inclined planes. Such an integrating 
intervening variable need have no properties other than those expressed 
in its units of measurement, that is, it need have no independent exist- 
ence, apart from the functional relationships it has in its systematic con- 
text. This kind of integrating intermediary, without surplus meanings, is 
called an intervening variable. By contrast, some kinds of intermediaries 
are concrete, tangible, palpable, with properties of their own. Suppose, 
for example, we describe the behavior that results when a cat is con- 
fronted with a barking dog. The cat arches its back, hisses, its hair stands 
on end, and numerous changes take place within its digestive and circu- 
latory system. Many of the internal changes can be explained by the 

8 MacCorquodale and Meehl (1946), Marx (1951b), Ginsberg (1954), Maze 
(1954). 
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use of a demonstrable intermediary, adrenin, the hormone of the 
adrenal glands. Suppose that before adrenin was isolated a theorist had 
inferred that some substance in the blood stream was causing the internal 
changes. This would have been a hypothetical construct at this stage, 
an inferred intermediary with palpable qualities. The discovery of 
adrenin would have then confirmed the hypothesis that such a substance 
in the blood stream was, in fact, causing many of the changes. Adrenin, 
as a substance, has other properties than those inferred from bodily 
changes in emotion. In this it differs from a mere intervening variable, 
which has no further properties beyond its systematic ones. 

Those who hold with intervening variables in their learning theories 
are free to choose such variables as they wish, provided they serve their 
systematic purposes of producing a more coherent and parsimonious 
theory than can be produced without them. Those who prefer hypo- 
thetical constructs must seek either demonstrable movements or secre- 
tions (if they are peripheralists ) , or some physiological brain processes 
(if they are centralists). Again, the issue over intervening variables or 
hypothetical constructs is not confined to one or the other of the major 
theoretical families. 

One extreme position is that we can do away with intermediaries 
entirely (Skinner). Thus, on this issue as on the others, we have nearly 
all possible views represented. 

This brief introduction to three contrasts between stimulus-response 
theories and cognitive theories, and three issues that are not confined 
to the two major families, should make it clear that what seem to be 
diametrically opposed points of view may turn out to be based on 
differences in preference, each being possible of persuasive statement, 
and to a point justifiable. The opposed cases are each made by intelligent 
men of good will. We shall have to wait until later to consider how a 
more unified outlook may eventually be achieved. 

THE PLAN OF THIS BOOK 

The student of learning, conscientiously trying to understand learn- 
ing phenomena and the laws regulating them, is likely to despair of find- 
ing a secure position if opposing points of view are presented as equally 
plausible, so that the choice between them is made arbitrary. He may 
fall into a vapid eclecticism, with the general formula, “There’s much to 
be said on all sides/’ 

This is not" a necessary outcome of a serious attempt to understand 
opposing points of view. Science ought to be systematic, not eclectic, 
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but a premature systematic position is likely to be dogmatic and bigoted 
just as an enduring eclecticism is likely to be superficial and opportunistic. 
It is possible to have systematization of knowledge as the goal without 
permitting the desire for system to blind the seeker after it to the truths 
unearthed by those with views unlike his own. 

Throughout the chapters that follow, in presenting one after the other 
a variety of systematic positions, with illustrative experiments testing 
their assertions, the effort is made to show that there is something to be 
learned from each of them. Each has discovered phenomena which move 
us forward in our knowledge about learning. At the same time, no one 
has succeeded in providing a system invulnerable to criticism. The con- 
struction of a fully satisfactory theory of learning is likely to remain for 
a long time an uncompleted task. 
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Chapter 2 

THORNDIKE’S CONNECTIONISM 


F OR nearly half a century one learning theory dominated all others 
in America, despite numerous attacks upon it and the rise. j)f its 
many' rivals. It is the theory of Edward L. Thorndike (1874-1949), first 
announced in his Animal intelligence (1898). Its pre-eminence has been 
aptly assessed by Tolman: 

The psychology of animal learning — not to mention that of child learning — 
has been and still is primarily a matter of agreeing or disagreeing with Thorndike, 
or trying in minor ways to improve upon him. Gestalt psychologists, conditioned- 
reflex psychologists, sign-gestalt psychologists — all of us here in America seem 
to have taken Thorndike, overtly or covertly, as our starting point. 1 

The, basis of learning accepted by Thorndike in his earliest writings 
was association between sense impressions and impulses to action. Such 
an association came to be known as a “bond” or a “connection.” Because 
it is these bonds or connections which become strengthened or weak- 
ened in the making and breaking of habits, Thorndike’s system has some- 
times been called a “bond” psychology or simply “connectionism ” As 
such it is the original stimulus-response or S-R psychology of learn- 
ing* 


CONNECTIONISM BEFORE 1930 

There were few" changes in Thorndike’s theory between 1898 and 
1930. During these years Thorndike devoted himself largely to applica- 
tions of his established theory to problems of educational and social im- 
portance. Because of the stability of the concepts during these years, it 
is possible to select any one of Thorndike’s many publications to serve 
as a guide to his theory. The major work, from which most of the quo- 
tations in what follows have been taken, is the three-volume Educational 


1 Tolman (1938a), page 11. 
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psychology (1913-1914), which represents the system at the height of 
its popularity. 

The most characteristic form of learning of both lower animals and 
man is identified by Thorndike as trial-and-error learning, or, as he 
preferred to call it later, learning by selecting and connectingTjThe learner 
is confronted by a problem situation in which he has to reach a goal 
such as escape from a problem-box or attainment of food. He does this by 
selecting the appropriate response from a number of possible responses. 
A trial is defined by the length of time (or number of errors) in- 
volved in a single reaching of the goalJ Thorndike’s earliest experiments 
were of this kind, done chiefly with c&ts, although some experiments with 
dogs, fish; and monkeys were included. 2 ^he typical experiment is that of 
a hungry cat confined in a box with a concealed mechanism operated by 
a latch. If the cat correctly manipulates the latch, the door opens, and the 
cat gains access to the food outside. The first trials are characterized by a 
great amount of clawing, biting, and dashing about before the latch is 
moved. The score, as measured in elapse jr time, is high. On succeeding 
trials the time scores get lower, but slowly and irregularly. It is this 
gradualness which suggests that the cat does not really "catch on” to the 
manner of escape, but learns it instead by the stamping in of correct 
responses and the stamping out of incorrect ones. 

Experiments like this have become so commonplace that the im- 
portance of their introduction by Thorndike is easily overlooked. By 
contrast with the other laboratory arrangements within which learning 
was studied, the problem-box brought to the fore the problems of motiva- 
tion,.^ of rewards and of punishments. The typical laboratory experi- 
ments on learning before Thorndike were either the experiments intro- 
duced by Ebbinghaus on the memorization and recall of verbal mate- 
rial? or the experiments on acquistion of skill exemplified by the Bryan 
and Harter studies of learning telegraphy. In both of these, motivation 
remains in the background as one of the contextual features, along with 
learning capacity and other factors not entering as manipulated variables. 
In his "law of effect” Thorndike brought motivation into the fore- 
ground. Trials were defined not by a repetition of a list (Ebbinghaus) 
or by so many minutes of practice (Bryan and Harter) but by the per- 
formance prior tojsuccessful (or unsuccessful) goal attainment, 

Thorndike saw that in his law of effect he had added an important 
supplement to the familiar law of habit formation through repetition: 

• 2 Thorndike' (1898) (1911). 
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But practice without zeal — with equal comfort at success and failure — 
does^not make perfect, and the nervous system grows away from the modes in 
which it is exercised with resulting discomfort. When the law of effect is omitted 
— when habit-formation is reduced to the supposed effect of mere repetition — 
two results are almost certain. By the resulting theory, little in human behavior 
can be explained by the law of habit; and by the resulting practice, unproduc- 
tive or extremely wasteful forms of drill are encouraged. 3 

The interest in rewards and punishments which grew out of his 
experiments with animals continued naturally enough as he turned his 
attention to learning as it occurs in schools. There the arguments over 
punishment, promotion, school marks, and other incentive devices were 
rife, even though academic psychologists had not yet awakened to the 
centrality of motivational concepts. 

Thorndike's experiments on animals had a very profound influence 
upon his thinking about human learning. He became convinced, con- 
trary to the then popular beliefs, that animal behavior was little medi- 
ated by ideas. ^Responses were said to be made directly to the situation 
as sensed. While he did not go so far as to deny ideation among animals, 
he was convinced that the great bulk of their learning could be explained 
by the direct binding of acts to situations, unmediated by ideas. A com- 
parison of the learning curves of human subjects with those of ani- 
mals led him to believe that the same essentially mechanical phenomena 
disclosed by animal learning are the fundamentals of human learning 
also. Although always aware of the greater subdety and range of human 
learning, he showed a strong preference for understanding more complex 
learning in terms of the simpler, and for identifying the simpler forms of 
human learning with that of animals. 

Both theory and practice need emphatic and frequent reminders that man's 
learning is fundamentally the action of the laws of readiness, exercise, and effect. 
He is first of all an associative mechanism working to avoid what disturbs the 
life-processes of the neurones. If we begin by fabricating imaginary powers and 
faculties, or if we avoid thought by loose and empty terms, or if we stay lost in 
wonder at the extraordinary versatility and inventiveness of the higher forms of 
learning, we shall never understand mans progress or control his education. 4 

The systematic position is best understood through the three laws 
to which he refers: readiness, exercise, and effect. It is in accordance 
with these laws that animal and human learning takes place. 

8 Thorndike (1913b), page 22. 

4 Thorndike (1913b), page 23. 
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The Law of Readiness 

The law of readiness is an accessory principle which describes a 
physiological substratum for the law of effect. It states the circumstances 
under which a learner tends to be satisfied or annoyed, to welcome or to 
reject. There are three such circumstances: 5 

1. When a conduction unit is ready to conduct, conduction by it is satis- 
fying, nothing being done to alter its action. 

2. For a conduction unit ready to conduct not to conduct is annoying, and 
provokes whatever response nature provides in connection with that particular 
annoying lack. 

3. When a conduction unit unready for conduction is forced to conduct, 
conduction by it is annoying. 

Although “conduction units” were referred to in his earlier writings 
as though he were talking about actual neurones, Thorndike did not, in 
fact, pay much attention to neuroanatomical details. He talked about 
neurones to be clear that he was talking about direct impulses to action, 
and not about “consciousness” or “ideas.” It must be remembered that 
Ills system antedated behaviorism, even though its emphasis was defi- 
nitely toward an objective account of behavior. The physiological lan- 
guage was the most available vocabulary for the objectivist prior to the 
rise of behaviorism. Actually Thorndikes “conduction units” have no 
/precise physiological meaning. It would be difficult, for example, to 
' understand how a physiological unit unready to conduct could be made 
^ to conduct. 

If for “conduction unit” a term such as “action tendency” is sub- 
stituted, the psychological meaning of Thorndike $ law of readiness be- 
comes clearer. When an action tendency is aroused through preparatory 
adjustments, sets, attitudes, and the like, fulfillment of the tendency in 
action is satisfying, non-fulfillment is annoying. Readiness thus means 
a pr^^ation^f or. j^etion J Thorndike uses the illustration of an animal 
running after its prey, getting ready all the while for jumping upon it and 
seizing it. He describes a child seeing an attractive object at a distance, 
getting ready to approach it, seize it, and manipulate it. He says that it 
is the neurones which prepare prophetically for later actions in the se- - 
quence. This somewhat objectionable manner of describing what goes on 
need not detract from the reality of the psychological observation that 
satisfaction and frustration depend upon what the organism is pre- 
pared to do. 

There is another kind of readiness familiar to educators. This is 
5 The wording follows Thorndike ( 1913a), page 128. 



THORNDIKE'S CONNECTIONISM 19 

illustrated by the use of “reading readiness" to refer to the child’s reach- 
ing a maturity level appropriate to the beginning of reading. Thorn- 
dike did not use his law of readiness in this way, and it would* be his- 
torically inaccurate to construe his law of readiness as an anticipation 
of maturational readiness. There is, of course, a logical relationship be- 
tween the two kinds of readiness, because interests and motives mature 
along with capacities. But Thorndike's readiness was a law of prepara- 
tory adjustment, not a law about growth. 

The Earlier Law of Exercise 

The law of exercise refers to the strengthening of connections with 
practice (Law of Use) and to the weakening of connections or forget- 
ting when practice is discontinued (Law of Disuse). Strengthening is de- 
fined by the increase in probability that the response will be made when 
the situation recurs. This probability may be either a greater probability 
of occurrence if the situation is repeated immediately, or an equal 
probability persisting longer in time. That is, a stronger connection is 
in a favored competitive position relative to other habits, either at the 
time of its strengthening or if tested after there has been opportunity 
for forgetting. 

The definition of strength by probability of occurrence 6 has a very 
contemporary ring. It is acceptable in itself to those who might go on 
to reject the structural basis assigned by Thorndike to changes in strength 
of connections. Although changes were said to occur in neurones and 
synapses, even in his earlier writings the precise properties attributed to 
neurones were cautiously stated and not actually essential to the theory. 

The kinds of phenomena falling under the law of exercise are chiefly 
those of repetitive habits, as in rote memorizing or the acquiring of mus- 
cular skills. Learning curves in which performance is plotted against 
trials represent the quantification of the law of use; forgetting curves 
give quantitative details for the law of disuse. During the period under 
discussion, Thorndike accepted uncritically the prevailing principle of 
learning by doing, even though he had criticized the use of the principle 
independent of the law of effect. He later altered his position and greatly 
reduced the emphasis upon the law of exercise. 

The Earlier Law of Effect 

The law of effect refers to the strengthening or weakening of a 
connection as a result of its consequences. When a modifiable connection 

* Thorndike ( 1913b ), page 
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is made and is accompanied by or followed by a satisfying state of 
affairs, the strength of the connection is increased; if the connection is 
made and followed by an annoying state of affairs, its strength is de- 
creased. 

Two chief objections have been made to the law of effect by its 
critics. At the height of behaviorism it was objected that satisfaction and 
annoyance were subjective terms, inappropriate for use in describing ani- 
mal behavior. But Thorndike was in reality ahead of his critics, for he 
had early stated what he meant by such states of affairs in what would 
today be called operational terms: 

By a satisfying state of affairs is meant one which the animal does nothing 
to avoid, often doing things which maintain or renew it. By an annoying state 
of affairs is meant one which the animal does nothing to preserve, often doing 
things which put an end to it. 7 

These definitions are not circular, so far as the law of effect is con- 
cerned. That is, the states of affairs characterized as satisfying and as 
annoying are specified independently of their influence upon modifiable 
connections. The law of effect then states what may be expected to 
happen to preceding modifiable connections which are followed by such 
specified states. The objection that Thorndike was lacking in objectivity 
in the statement of the law of effect is not a valid one. 

The second objection was that the backward effect of a state of 
affairs on something now past in time is not conceivable. The past is 
gone, effects can be felt only in the present, or perhaps revealed in 
the future. The criticism, like the first, is a faulty one. The effect is 
revealed in the probability of occurrence of the response when the situa- 
tion next occurs; whether or not such an effect occurs is a matter of ob- 
servation and experiment, not something to be denied on a priori grounds. 
In fairness to his critics, it must be said that Thorndike's insistence on a 
backward influence upon neurones encouraged such objections to the 
law of effect Some of his statements were indeed objectionable, but the 
objectionable statements never did express the essence of the ..law- of 
effect, which is essentially an empirical matter. 

Translated into more familiar words, Thorndike is saying in this 
law that rewards or successes further the learning of the rewarded be- 
havior, whereas punishments or failures reduce the tendency to repeat 
the behavior leading to punishment, failure, or annoyance. So much 
would merely be a reassertion of common observations. But he went 
further and insisted that the action of consequences is direct and need 
7 Thorndike ( 1913b), page 2. 
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not be mediated by ideas. In this insistence his law of effect anticipates 
the reinforcement principle adopted in many conditioned-response 
theories. The later changes in theory reduced the importance of an- 
noyers relative to satisfied and added some new phenomena, but the 
central importance of a modified law of effect persisted in Thorndike’s 
final statements of his position. 

Subordinate Laws 

The major law s of readiness, exercise, ^and effect were said to have 
five subsidiary laws app licable to both animal and hum an lea rning. 8 
Among these occurs one, associative shifting, which is so similar to one 
variety of conditioned response theory that it deserves special mention. 

In a short account of Thorndike’s views, the impression may be 
given that Thorndike was a very systematic writer. His "system,” apart 
from a few persistent preferences, is in fact a rather loose collection of 
rules and suggestions. What was called a "law” at any one time was a 
statement which at the time apeared to Thorndike to have some general- 
ity of application. No effort was made to retain internal coherence among 
the concepts used, or to establish any genuine relationship of co-ordina- 
tion or subordination among the laws. The five "subordinate laws” to be 
discussed are principles which seemed to Thorndike somewhat less im- 
portant than the major laws of readiness, exercise, and effect. They are 
not related to the major laws in any clear manner, and in later writings 
they were occasionally omitted, occasionally revived. 

1. Multiple response . The first of the five principles is that of multi- 
ple response or varied reaction. In order for a response to be rewarded, 
it must occur. When he faces a problem the learner tries one thing after 
another. When the appropriate behavior is stumbled upon, success fol- 
lows and learning is possible. Were the organism unable to vary its re- 
sponses, the correct solution might never be elicited. 

2. Set or attitude . The second principle is that learning is guided 
by a total attitude or "set” of the organism. Responses are determined 
in part by enduring adjustments characteristic of individuals raised in 
a given culture. But they are also influenced by more momentary ten- 
dencies. The attitude or set determines not only what the person will do, 
but what will satisfy or annoy him, Thorndike says that a more ambitious 
golfer will be annoyed by shots which the more modest would cherish. 
This principle is related to a series of conceptions coming to prominence 
later in discussions of level of aspiration. 

8 Thorndike (1913b), pages 23-31. 
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3. Prepotency of elements. The third principle states that the learner 
is able to react selectively to prepotent elements in the problem. That is, 
a man can pick out the essential item and base his responses upon it, 
neglecting other adventitious features which might confuse a lower 
animal. This ability to deal with the relevant parts of situations makes 
analytical and insightful learning possible. 

4. Response by analogy. The fourth principle is that of assimilation, 
or response by analogy. How does man react to novel situations? He re- 
sponds to a new situation as he would to some situation like it, or he 
responds to some element in the new situation to which he has a re- 
sponse in his repertory. Responses can always be explained by old ac- 
quisitions, together with inborn tendencies to respond; there is nothing 
mysterious about responses to novelty. 

5. Associative shifting. The fifth of these subsidiary laws is called 
associative shifting. The fundamental notion is that if a response can be 
kept intact through a series of changes in the stimulating situation, it 
may finally be given to a totally new stimulus. The stimulating situation 
is changed first by addition, then by subtraction, until nothing from the 
original situation remains. Thorndike illustrates by the act of teaching 
a cat to stand up at command. First a bit of fish is dangled before the 
cat while you say, “Stand up.” After enough trials, by proper arrangement, 
the fish may be omitted, and the oral signal will alone evoke the re- 
sponse. The most general statement of the principle of associative shift- 
ing is that we may “get any response of which a learner is capable as- 
sociated with any situation to which he is sensitive.” 9 This is obviously 
related to that type of conditioning in which the process is described 
as substituting a conditioned stimulus for an unconditioned one. Thorn- 
dike noted the similarity, but believed the conditioned response to be 
a more specialized case under the broader principle of associative shifting. 
While in his earlier writings associative shifting was but the fifth of 
the subordinate laws, in later books it was “promoted,” becoming a kind 
of learning second to that by selecting and connecting. 10 

Controlling Learning 

There is always some danger of misunderstanding a systematic 
writer's influence if attention is confined to the more abstract and gen- 
eralized laws whiepjhe proposes, to the neglect of some of the accessory 
details which givelboth flavor and body to his teaching. Thorndike as 

9 Thorndike ( 1913b , Ti page 15. 

10 ThnrnHilfA 1Q1_1Q7 
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early as 1913 was giving much more attention to the dynamics of learn- 
ing than a formal consideration of his laws suggests. 

Within the framework of his primary laws, he saw three considera- 
tions which affected the teacher's problem in using them in the class- 
room. 11 These were: (1) ease of identification of the bonds to be formed 
or broken; (2), ease of identification of the states of affairs which should 
satisfy or annoy; and (3) ease of application of satisfaction and annoy- 
ance to the identified states of affairs. The teacher and the learner must 
know the characteristics of a good performance in order that practice may 
be appropriately arranged. Errors must be diagnosed so that they will 
not be repeated. When there is lack of clarity about what is being taught 
or learned, practice may be strengthening the wrong connections as well 
as the right ones. At the same time, needed connections may be weak- 
ened by disuse. It is especially hard to teach imagination, force, and 
beauty in literary expression because it is difficult to be specific about 
the conduct which should be made satisfying at the time it occurs. The 
importance of specificity runs throughout Thorndike's writings. As we 
shall see later, this is at once a source of strength in his system and one 
of its points of vulnerability. 

But Thorndike's advice is not limited to the application of his major 
laws. He refers also to a number of motivational features not readily 
deducible from the laws of readiness and effect. Five aids to improvement 
he lists as the interest series. 1 - These he believes to be commonly ac- 
cepted by educators: 

1. Interest in the work 

2. Interest in improvement 

3. S'rvfi'v-co 

4. T .i > . : : , :!■.! -do 

5. Attentiveness 

To these five he added two more which he felt were open to some 
dispute. They were the absence of irrelevant emotion and the absence 
of worry. In his emphasis upon satisfiers and annoyers he is not talking 
about "crude emotional states," which he believes are to be avoided. 

In the case of improvement in skill, the balance turns again toward freedom 
from all the crude emotional states and even from all the finer excitements, save 
the intrinsic satisfyingness of success and a firm repudiation of errors which can 
hardiy be called exciting. 13 

11 Thorndike (1913b); pages 213-217. 

12 Thorndike (1913b), pages 217-226. 

« Thorndike (1913b), pages 226-227. 
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Thus to his rather harsh and brittle doctrine of specificity of con- 
nections he adds informal considerations which do much to temper it. 
The active role of the learner, who comes to the learning situation with 
needs and problems which determine what will be satisfying to him, is 
recognized implicitly in the commentaiy on the laws, although it lacks 
explicit statement in the laws themselves. It is probably these accessory 
features which have commanded ardent support by Thorndike’s fol- 
lowers, while it is the more abstract features which have been the focus 
of attack by those who have disagreed with him. 

The Identical-E lements Theory of Transfer 

Schools are publicly supported in the hope that more general uses 
will be made of what is learned in school. To some extent all schooling 
is aimed at a kind of transfer beyond the school. Whether the proper 
way to achieve this end may turn out to be to teach more formal subject 
matter, like mathematics and the classics, or to give more attention to 
practical subject matter like manual training and social studies, the 
problem is a central one for educators. 

Thorndike early interested himself in the problem. His theory began 
to take form in an experimental study done in collaboration with Wood- 
worth 14 and was formally stated in his early Educational psychology 
j^(1903). The theory proposes that transfer depends upon the presence 
j of identical elements in the original learning and in the new learning 
| which it facilitates. These may be identities of either substance or pro- 
cedure. Thus the ability to speak and write well are important in all 
schoolroom classes and many tasks of ordinary life. Hence mastery of 
\these skills will serve in different pursuits, and transfer will be effected 
through what the different situations require in common. The substance 
of what is required in different situations may be unlike, but there may 
be procedures in common. The procedures of looking things up in such 
diverse sources as a dictionary, a cookbook, and a chemists handbook 
have much in common, despite the unlike contents of the three kinds of 
book. If an activity is learned more easily because another activity was 
learned first, it is only because the two activities overlap. Learning is al- 
ways specific, never general: when it appears to be general, it is only be- 
cause new situations have much of old situations in them. 

Intelligence as jneasured bj tests may. be .thought of as to some ex-^ 
tent a measure of the transfer-capacity of an individual. That is, the test 
measures the ability to give right answers in relatively novel situations. 

14 Thorndike and Woodworth ( 1901 ). 
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It is logically sound that Thorndike’s theory of intelligence is, like his 
theory of transfer, a matter of specific connections. The more bonds the 
individual has which can be used, the more intelligent he is. 

Thorndike’s specificity doctrines of transfer and of intelligence have 
been highly influential and have led to a great deal of experimental work. 
Although the problems of the nature of and measurement of intelligence 
lie outside the scope of this volume, there will be occasion later to con- 
sider some of the alternative explanations of transfer. 

During the stable period of Thorndike’s system there were many 
changes in psychological climate, but these left him unruffled. The rise 
of behaviorism and the new importance attributed to the conditioned 
response affected him but little, because the new enthusiasts were talking 
a congenial language, even when they included him in their sweeping 
attacks on everything which preceded them. The attacks by die gestalt 
psychologists in the ’20’s were more telling, and he began later to meet 
some of their criticisms. But it was his own experiments which led him 
to come before the International Congress of Psychology in New Haven 
in September, 1929, widi the statement, “I was wrong.” He there an- 
nounced two fundamental revisions in his laws of exercise and effect 
which became die basis for a number of publications dating from 1930. 

CONNECTIONISM AFTER 1930 

The revisions of his fundamental laws were reported by Thorndike 
in a number of journal articles and monographs with various collabora- 
tors, the main results being gathered in two large volumes under the 
titles The fundamentals of learning (1932a) and The psychology of 
wants , interests , and attitudes ( 1935 ) . The law of exercise was practically 
renounced as a law of learning, only a trivial amount of strengthening 
of connections being left as a function of mere repetition. The law of 
effect remained only half true, the weakening effects of annoying conse- 
quences being renounced. For the two central laws there was substituted 
half the original law of effect. 

Disproof of the Law of Exercise 

The type of experiment used to disprove the law of exercise was 
that in which repetition went on under circumstances in which the law 
of effect could not be applicable. For example, repeated attempts to draw 
a line exactly 3 inches long while blindfolded did not lead to improve- 
ment, no matter how frequent the repetitions. Practice brings improve- 
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ment only because it permits other factors to be effective; practice itself 
does nothing- Thorndike intended that his “repeal*' of the law of exer- 
cise should be a safeguard against its misuse, not a denial of the im- 
portance of controlled practice. The laws of habit strengthening must 
be those of the conditions under which practice takes place; mere repe- 
tition of situations is not enough. If the person is informed each time 
after his attempt to draw a 3-inch line that his product is too long or too 
short, he will improve with repetition. 15 

The law of exercise was not, in fact, fully repealed. It is said that 
repetition of situations produces no change in strength of connections, 
but repetition of connections may produce a small advantage for that 
connection as against competing connections attached to the same situa- 
tionJThe strengthening is almost negligible; for all practical purposes, 
connections get strengthened by being rewarded, not by just occurring.) 
In one of those curious and optimistically quantitative summaries which 
Thorndike occasionally made, he concluded that a^ single occurrence 
followed by reward strengthens a connection about six times as much as 
it would be strengthened by merely occurring. 16 1 

r 

The Truncated Law of Effect 

■j A number of experiments yielded data showing that the effects of 
(reward and punishment were not equal and opposite, as had been im- 
plied in earlier statements of the effects of satisfiers and of annoyers. In- 
stead, under conditions in which symmetrical action was possible, re- 
ward appeared to be much more powerful than punishment. This con- 
clusion, if confirmed, is of immense social importance in such fields of 
application as education and criminology. 

One of these experiments was done with chicks. 17 A simple maze 
gave the chick the choice of three pathways, one of which led to "free- 
dom, food, and company” — that is, to an open compartment where there 
were othej- chicks eating. The wrong choices led to confinement for 30 
seconds. Statistics were kept on the tendencies to return to the preced- 
ing choice if it had led, to reward, and to avoid the preceding choice 
if it led to punishment. Thorndike interpreted his findings as follows; 
"The results of all comparisons by all methods tell the same story . Re- 

15 Trowbridge and Cason { 1932 ) . There is indeed more regularizing of performance 
in his own experiments than Thorndike believed, according to recomputation of his 
data by Seashore and Bavelas*( 1941). 

18 Thorndike (1940), page 14. 

17 Thorndike ( 1932b). 
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warding a connection always strengthened it substantially; punishing it 
weakened it little or not at all'.* 1B 

The corresponding* experiment with human subjects consists of a 
multiple-choice vocabulary test. For example, a Spanish wordjs given 
with five English words, one of which is its correct translation. A second 
and a third word follow, and so on through a list, each word with alterna- 
tive translations arranged in the same manner. The subject guesses the 
word which is correct, underlines it, and then hears the experimenter 
say Right (‘rewarded” response) or Wrong (“punished” response). How 
will he change his responses the next time through the list? As with the 
chicks, reward leads to repetition of the rewarded connection, but punish- 
ment does not lead to a weakening of the punished connection. In six 
experiments of this general sort, Thorndike concluded that the announce- 
ment of Wrong did not weaken connections enough to counterbalance the 
slight increase in strength gained from just occurring . 19 

Thorndike and his staff went on to collect a series of testimonials 
about the relative efficacy of rewards and punishments from published 
biographies and other sources, going back many years. The almost uni- 
versal evidence of the greater beneficial effect of reward than of punish- 
ment gives practical support to the findings of the experiments, which 
otherwise would be criticized as too far removed from ordinary life . 20 

There were some statistical difficulties in Thorndikes interpre- 
tations of his data which caused him to underestimate the significance 
of punishment. On the whole, however, he probably did a service through 
calling attention to the asymmetry of the effects of reward and of punish- 
ment. 

As in the disproof of the law of exercise, the repeal of the principle of 
weakening by annoying after-effects is not absolute. It is only direct weak- 
ening which is denied. Punishments do, according to Thorndike, affect 
learning indirectly. Thorndike says that their indirect effect comes 
chiefly from leading the animal to do something in the presence of the 
annoyer which makes him less likely to repeat the original connection. 
But this is not necessarily the case. 

An annoyer which is attached to a modifiable connection may cause the 
animal to feel fear or chagrin, jump back, run away, wince, cry, perform the 
same act as before but more vigorously, or whatever else is in his repertory as a 
response to that annoyer in that situation. But there is no evidence that it takes 

18 Thorndike ( 1932b ), page 58. 

19 Thorndike (1932a), page 288. 

so Thorndike (1935), pages 135-144; 248-255, 
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* away strength from the physiological basis of the connection in any way com- 
* parable to the way in which a satisfying after-effect adds strength to it* 21 

Thorndike was less successful in his attempts to explain the action 
of effect than in demonstrating that there are phenomena to which his 
principles apply. He distinguished between a direct confirming influence 
and the informative influence of rewards. Control of behavior according 
to the information supplied by its consequences implies mediation by 
ideas of the sort, “If I do this, I get fed; if I do that, I get slapped.” 
Thorndike believed that he kept this kind of deliberation at a minimum 
in his experiments, so that what he had to explain was the direct con- 
firmatory reaction which he said was responsible for the strengthening 
of responses through reward. This confirming reaction is vaguely de- 
scribed as an “unknown reaction of the neurones” which is aroused by 
the satisfier and strengthens the connection upon which it impinges . 22 
The confirming reaction is said to be independent of sensory pleasures, 
and independent of the intensity of the satisfier. It is highly selective, 
depending upon aroused drives or “overhead control in the brain.” While 
such an account is far from satisfactory, it at least helps to show where 
Thorndike stood. He was against mediation by ideas, as an interpreta- 
tion of effect according to information would imply. At the same time, 
he recognized the complexity of the reinforcement process and was not 
committed to a simple hedonism. The law of effect was for him no longer 
a law of affect, as Hollingworth once named it . 28 

Belongingness 

In addition to the revisions of the laws of exercise and effect, several 
new terms entered as Thorndike's system was revised. One of these, 
belongingness , by its recognition of an organizational principle foreign 
to the structure of Thorndike's theory of specificity and mechanical ac- 
tion, made concessions to the gestalt psychologists . 24 According to this 
principle, /a connection is more easily learned if the response belongs to 
the situation, and an after-effect does better if it belongs to the connec- 
tion it strengthens.^ Thus a series of sentences may be read, each of the 
form “John is a butcher. Henry is a carpenter.” The association John- 
butcher is a stronger one following such a reading than the association 
butcher-Henry, even though the latter connection is based on more nearly 
contiguous items. The reason is that a subject and predicate belong to- 

21 Thorndike (1932a), pages 311-313. 

22 Thorndike ( 1933c ) . 

23 Hollingworth ( 1931 ) . 

24 The point was made early by Brown and Feder ( 1934). See also Guthrie ( 1936b), 
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gether in a way in which the end of one sentence and the beginning of 
another do not. The belongingness of a reward or punishment depends 
upon its appropriateness in satisfying an aroused motive or want in the 
learner, and in its logical or informative relationship to the activities re- 
warded or punished. Thus to be rewarded by having your thirst quenched 
when you lift a glass of cool water to your lips is reward with belong- 
ing. If the same series of movements led sporadically to an electric shock 
on your ankle, that would be punishment without belonging. Although 
Thorndike states that after-effects are influential without either belong- 
ingness or relevance, he points out at the same time that they are more 
effective when they do belong and when they are relevant. 25 

While the principle of belongingness may be interpreted as something 
of a concession, the principle of polarity is emphasized as defying gestalt 
principles. 26 The principle of polarity is that connections act more easily 
in the direction in which they were formed than in the opposite direc- 
tion. If you have learned the items of a German-English vocabulary, it 
is easier to respond to the German word by its English equivalent than 
to the English word by its German equivalent. If a connection is thought 
of as a new whole, Thorndike contends, the polarity principle ought not 
be important. It ought then to be as easy to dissociate parts from the 
whole in one direction as in another. 

Other new terms introduced, such as “impressiveness," “identifi- 
ability,” “availability," and “mental systems," represent informal exten- 
sions of notions already latent in the earlier writings 

Discovery of the Spread of Effect 

In 1933 a new kind of experimental evidence was offered in support 
of the law of effect, evidence described as the spread of effects The 
experiments purported to show that the influence of a rewarding state 
of affairs acts not only on the connection to which it belongs but on 
adjacent connections both before and after the rewarded connection, the 
effect diminishing with each step that the connection is removed from 
the rewarded one. The effect acts to strengthen even punished connec- 
tions in the neighborhood of the rewarded one. (The experiments lent sup- 
port to the automatic and mechanical action of effect. A characteristic 
experiment was that in which the subject was asked to state a number 
from 1 to 10 following the announcement of a stimulus word by the 

25 Thorndike (1935), pages 52-61. 

26 Thorndike (1932a), page 158. 

2 7 Thorndike ( 1933a ) ( 1933b ) . 
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experimenter. The experimenter then called his response Right or Wrong , 
these “rewards” and “punishments” conforming either to a prearranged 
assignment of correct numbers to each word, or to some systematic pat- 
tern of “rights” or “wrongs.” In either case, the assignment of numbers 
from the point of view of the subject is arbitrary, and the cue to re- 
peat the number first assigned or to change it comes from what the ex- 
perimenter says following each word. The lists are so long that the sub- 
ject cannot recall on the second trial just what was done on the first one. 

After the list lias been read a number of times in this manner, the 
responses of the subject are classified to find the number of times the 
response was repeated. Not only are the rewarded responses repeated 
more often than the others, but responses called Wrong are repeated be- 
yond chance expectancy if they occur near in ' time to a response called 
Right. To some extent the phenomena included in the spread of effect 
come nearest to a “discovery” in the whole of Thorndike’s work. Because 
of the novelty of the phenomena and their systematic relevance, typical 
experiments in support of and critical of the spread of effect are re- 
viewed below as illustrative of Thorndike’s influence upon experimen- 
tation in the field of learning. 

EXPERIMENTS ON THE SPREAD OF EFFECT 

The spread of effect is the most characteristic feature of contempo- 
rary connectionism. It illustrates at once both the relative potency of 
rewards and punishments and the semi-automatic manner in which ef- 
fects act upon connections, whether or not they “belong.” To the extent 
that the phenomena of the spread of effect stand upon firm ground, 
Thorndike’s basic conceptions are buttressed against attacks by his critics. 

Punishment More Effective Than Thorndike Believed 

Thorndike had already changed his mind about the effectiveness of 
annoyers in weakening connections before the spread-of-effect experi- 
ments were announced. He interpreted his results on the spread of ef- 
fect as confirming his recently, acquired belief in the relative ineffective- 
ness of punishments in weakening connections. The spread of effect is 
then, to him, a spread of positive effects, a gradient of reward. Reward 
is said to strengthen even neighboring punished connections." ’ 

In several of his experiments, Thorndike made a faulty assumption 
about the baseline of chance expectation. He assumed that one alterna- 
tive in a series of multiple possibilities was as likely to occur as another. 
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That is, if there were four choices, by chance each should occur 25 per 
cent of the time. When the opportunity is given to repeat the responses, 
the same one chosen last time should be chosen again in 25 per cent of 
the cases, if the tendency to respond has been neither strengthened nor 
weakened during the first occurrence. In practice this chance repeti- 
tion seldom occurs, for whatever predisposition leads to a preference 
for one of the choices the first time tends to favor that same choice the 
next time. Suppose there are four alternatives, and the series is presented" 
twice. If there is a 35 per cent agreement on choices between the first and 
the second time, this does not mean that the gain over chance resulted 
from repetition. The above-chance preference may have been there on 
the first trial. This criticism has been made by Stephens (1934) and Hull 
(1935d), among others, and empirical results show that punishment does 
in fact lead to fewer repetitions. The result of Thorndike’s use of too 
low a base for chance repetition was automatically to assign too much 
strengthening to rewards and too little weakening to punishments. The 
asymmetry could be entirely a statistical artifact. 

Tilton, a former associate of Thorndike, repeated the spread-of-effect 
experiment with careful controls to determine what the empirical level 
of repetition would be without the saying of Right and Wrong , and 
then proceeded to plot the spread of effect on either side of a rewarded 
and a punished response. 28 He made a correction also for a serial posi- 
tion effect, that is, for a tendency to repeat the same response to items 
near the beginning and the end of the list. When correction is made, it 
is found that the effects of Right and Wrong are about alike, the an- 
nouncement of Wrong decreasing repetitions about as much as Right in- 
creases them. A replotting of Tilton’s results is shown in Figures 1 and 2. 

Tilton’s study shows that there is a tendency for punished responses 
in the neighborhood of reward to be repeated more frequently than such 
responses remote from reward. Their punishment (being called Wrong) 
suffices however, even one step from reward, to lead to less repetition 
than would occur if the response were neither rewarded nor punished 
(Figure 1). Similarly, when a response called Wrong (punished) oc- 
curs in the midst of a series of rewarded responses, the neighboring re- 
warded responses are repeated less frequently than they would have 
been had they not been in the neighborhood of the punished response. 
Again, however, their reward (being called Right ) is enough to lead 
to their repetition at a greater frequency than that represented by the 
neutral baseline (Figure 2). 
ss Tilton (1939) (1945). 
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Because Tilton used the conventional method of obtaining spread of 
effect, repeating lists over and over again, and allowing right responses 
and wrong responses to appear with any degree of separation, he had to 
make a rather elaborate analysis in order to exhibit just what was in 
fact happening within his experiment. Martens (1946) introduced a cor- 
rective by arranging the experimental situation so that each position 
before and after reward has an equal opportunity to be represented in 
the final spread of effect pattern. This she did by limiting the training 
and testing to one trial each on each list. That is, on one trial through 



PRECEDING RESPONSE FOLLOWING PRECEDING RESPONSE FOLLOWING 
FIG. 1 FIG. 2 

FIGURE 1. Gradient of effect around a Right response. Calling a response Right 
increases its repetition, calling a response Wrong decreases its repetition; but the 
decrease is less when the Wrong is near to an isolated Right . Redrawn from Tilton 
(1945). 

FIGURE 2. Gradient of effect around a Wrong response. Calling a response 
Wrong decreases its repetition, calling a response Right increases its repetition, but 
the increase is less when the Right i s near to an isolated Wrong. Redrawn from Tilton 
(1945). 

a list of words the subject arbitrarily assigned numbers and was informed 
each time whether the number was right or wrong. The experimenter was 
able to assign the word Right to prearranged positions, all other posi- 
tions being assigned the word Wrong . Then a single additional trial was 
used to determine the extent to which the subject would repeat the num- 
bers called Right and repeat or change the numbers called Wrong . The 
statistical handling and interpretation of the effects of Right and Wrong 
are greatly simplified, for according to the prearranged pattern there, 
are an equal number of opportunities to give the same response or a 
different response to each of several positions before and after the re- 
warded connection. The control situation consisted in the subject s assign- 
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ing numbers as in a free association experiment, with the list repeated 
twice, no announcements of Right or of Wrong being used. 29 Her re- 
sults in this modified situation supported Tilton's, so far as the effect 
of a rewarded response imbedded in the midst of punished responses is 
concerned. That is, the effect of Wrong , , even adjacent to the rewarded 
connection is greater than whatever influence can be attributed to the 
spread of the effect of Right The direct effects of both Right and of 
Wrong are greater than any spread from one to the other. 

Stephens (1941), like Tilton, a former student of Thorndike, has 
madeT~ an interesting mediating suggestion J hat “symbolic" reward 
str engthens weak connections more than “symbolic" punishment wea k- 
ens them, but :_syinbolic pimish merit w e akens strong connections mo re 
than"s^bohcrew ard strengthens the rmy According to this view, it was 
because Thorndike tended to work mostly with weak connections that 
he got the results he did. Stephens' experiment was done by using state- 
ments such as appear on attitude scalesjo determine certain convictions 
held strongly by the subjects, ^others held less strongly. Then a law-of- 
effect experiment was arranged, in which multiple-choice answers were 
permitted to selected statements, and the subject was informed which 
choice was correct. The method of information was the ingenious one! 
of using chemically prepared paper so that the correct response came 
out in color when it was marked by the instrument provided. This per- 
mitted the experiment to be done with groups. When the experiment was 
.then repeated, it was possible to see which tendency was the greater, to 
repeat the response answered correctly or to change from the response 
answered incorrectly. The change was found to be correlated with the 
^conviction represented by the answer. The better-established replies were 
more influenced by punishment, the less-well-established by reward. 

There is a difficulty of interpretation in this experiment which may 
help to account for uncertainty in other experiments about the relative 
influence of reward and punishment. There is a logical difference be- 
tween responding in the intelligent direction to Right and to Wrong . 
The intelligent response to Right is to do again what was last done. This 

29 The "empirical chance” or "neutral” level of repetition depends upon the in- 
structions within the control period. Martens' method, which sought "natural” associates 
between words and numbers, as in the free-association experiment, led to a higher 
baseline of , repetition than Wallach and Henle’s (1941) method, which may have 
mildly discouraged repetitions. The problem of level of repetition within different 
kinds of f.vhs u,.s later studied by Sheffield and Jcnldr.c (1952), whose results are 
reported further on. The interpretation of the rclilixc ellic.ic) of reward and punish- 
ment depends upon the assumed baseline, and if there is arbitrariness about the base- 
line there must be arbitrariness in the interpretation of positive and negative effects. 
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makes possible immediate rehearsal; the task is clear. The intelligent re- 
sponse to Wrong is to do something different, but what to do is less 
clear. It is necessary both to remember what not to do and to form some 
sort of hypothesis as to what to do. Under time pressure this vagueness 
might well produce an asymmetry between responses following Right 
and Wrong. In the case of poorly established convictions, it may be 
harder to remember what was said that was called Right than in the 
case of well-established convictions to remember what was said that 
was called Wrong. If that were true, Stephens’ results would follow. 
Anything which makes it easier to make the necessary discriminations 
in one case than in another will effect the relative potency of hearing 
Right or Wrong , and of modifying subsequent behavior accordingly. In 
other words, the situation is not nearly as mechanical and stupid as it 
appears; in fact, the results can be best understood on the assumption 
that the individual is doing his best to act intelligently under somewhat 
confusing circumstances. 

The Theory of Discrete Connections Probably Faulty 

The spread of effect as Thorndike describes it presupposes a series 
of stimulus-response connections with no other organization among them 
than succession in time. What is strengthened (or weakened) is the tend- 
ency for a stimulus to be followed by the response which accompanied 
it the last time. Other experimenters have given evidence that additional 
principles of organization do apply, and the automatic spread of the in- 
fluence of reward to neighboring connections is either incorrect or a gross 
oversimplification. 

Zirkle (1946a), by changing the order of stimulus words on suc- 
cessive trials, showed that what was repeated was a neighboring response, 
/not a stimulus-response connection. That is, a tendency to repeat the 
response "five” given originally to the word “youthful* two steps removed 
from reward, was found on the next trial to be given to “supports” as 
the stimulus word in the cor respond ing position. There was no increase 
in the tendency to say “five” after “youthful” unless it appeared in its old 
--place. Under the usual arrangements of the experiment on spread of ef- 
fect, it would be impossible to distinguish between strengthening a re- 
sponse or a connection, but under the special circumstances of Zirkle’s 
experiment it is clear that response is the more important beneficiary of 
the reward. 

This suggestion by Zirkle that it is the response rather than the con- 
nection which enters prominently into spread-of-effeet data receives fur- 
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ther support from the experiments of T enkins and Sheffield (1 946). They 
show that guessing habits are important in the usual spread-of-effect 
experiments. Guessing habits refer to patterns of response, not to iso- 
lated stimulus-response preferences. They found that if a rewarded re- 
sponse was repeated, othe r re sponses followi ng it were also repeated; if 
a re warded respon se was not rep eated, ot her responses in its^neighbor- 
hoo d were not r epeated beyond chance levels. Hen ce the repetition of a 
re warded resp onse a ppears t o be more im portant than the fact o f re- 
ward. The major effect of reward is to lead to repetition of the correct 
response; once this success is achieved, the repetition of responses after 
the reward depends upon guessing habits. 80 



FIGURE 3. Distribution of steps from the first*to the second occurrence of a digit 
from “one” through “six” in a random series and in a “subjectively random” series 
produced by 15 subjects. Note that the subjects typically avoid using the same digit 
a second time until other digits have been used. From Smith (1949), page 360. 

A related but supplementary suggestion has been made and verified 
by M. H. Smith, Jr. (1949). In what he calls the ^ probabi lity-bias^hy- 
pojhesis,” he points out that subjects in attempting to respond with a 
ranaoSnr series of digits tend to avoid using the same digit over again 
until most of the digits have been used up. In a chance series, the digit , 
has an equal chance of coming up on each trial; its appearance on any 
one trial is not influenced by the fact that it may have occurred on the 

30 The guessing sequence hypothesis has been amply supported through additional 
work by the same authors, their collaborators, and others: e.g., Jenkins and Cunning- 
ham (1949), Sheffield (1949b), Fagan and North (1951;, Sheffield and Jenkins 
(1952). 
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preceding one. Intervals between two occurrences of the same digit in a 
‘random” series are plotted in Figure 3 along with the intervals between 
two occurrences in a "subjectively random” series. The striking differ- 
ence between the curves arises because the subjects, in attempting to 
'produce a random series, postpone the recurrence of a digit once used. 
How can this bias against repetition produce a spread of effect? The 
chance of repeating a digit used on the preceding trial following a cor- 
rect (Right) response is enhanced by the avoidance of the digit just 
used as the "correct” response. While this effect is slight, so is the amount 
of the spread of effect as commonly found. A consistent slight bias of 
this kind is enough to produce statistically significant results. 

There are thus seen to be organizational factors in the spread of 
effect not described simply by remoteness of connections from rewards. 

Is “Reward” the Basis for the Enhancement Effect? 

The presumption is made in Thorndike’s experiments that the repe-^ 
tition of the response called Right results directly fron Lreward . That this 
may not be the case is shown in the experiments of Wallach and Henle 
(1941, 1942). The subject believed himself to be in an experiment on 
extrasensory perception, so that as he went through a list he was trying 
to assign the number which had been assigned by someone else for that 
particular trial. Hence a response called Right on one trial might be in- 
correct on the next trial. There was thus no intent to learn the Right 
responses, and in fact the hearing of the word Right did nothing to im- 
prove the repetition of either the rewarded response or its near neighbors. 
It appears that what the subject is trying to do is more important than 
one would suppose from Thorndike’s insistence on thejaitoinaticitv of the 
e ffect of rewa rd. 

Another of Zirkle’s experiments 31 bears importantly on the question 
of what causes the repetition of the response called Right. Influenced 
by the possibility that something like a figure-ground relationship may 
be effective in these situations, Zirkle arranged to have three degrees 
W isolation of the response called Right. It was his conjecture that 
with greater isolation there would be greater repetition of this response, 
and perhaps of its near neighbors also.' The degrees of isolation were 
obtained by inserting within a list of black lower-case words as the words 
to be called Right (a) a nonsense syllable in jed jDapitaL letters, the 
highest degree of isolation, (b) a word in black capital letters, the next 
degree of isolation, and (c) a word printed the same as all the other 
31 Zirkle (1946b). 
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words, in black lower-case letters, but isolated solely by being called 
Right in the midst of consecutive words called Wrong. The results are 
plotted from Zirkle's data in Figure 4. His prediction was fully con- 
firmed. Not only was the most isolated response most regularly repeated, 
but its repetition was followed by the enhanced repetition of the words 
immediately following. Because the spread of effect was not demon- 
strated for the words immediately preceding the isolated word, it is 



FIGURE 4. Gradient of effect around Right responses with three degrees of 
isolation. All isolated responses called Right, all ~ * called Wrong. 

The differences among the three curves depend tp . .! , , »*. , - in degree of 

isolation of the items called Right . The most isolated item, a nonsense syllable in red 
capital letters; the medium isolated item, a word in black capital letters; the least 
isolated item, a word piinted like all other items, in black lower-case letters, isolated 
only by being called Right in the midst of words called Wrong. Plotted from Zirkle 
(1946b). 

probable that isolation led to the repetition of the isolated word, and 
that guessing habits then produced the after-gradient. This conjecture 
is indirectly supported by the experiments of Jenkins and Postman 
(1948), in which they found no gradient around an isolated stimulus 
when guessing sequences were eliminated. 

In his experiment on spread of effect when there was neither reward 
nor opportunity for learning, Sheffield (1949b) found the typical “spread,” 
but the level of repetition waslower than in the usual Thorndike experi- 
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ment. While Thorndike S2 had found the average per cent of repetition 
for the position following reward to be 20.4, the corresponding per cent 
m Sheffields experiment was 13.3. Was there after all some effect of “re- 
ward,” or can the difference in level be accounted for on other grounds? 

Sheffield and Jenkins (1952) set out to determine whether or not 
the reward was responsible for the enhancement in Thorndike’s experi- 
ments. They continued to use Sheffield’s procedure of having subjects 
assign numbers at random, without any statement of Right or Wrong or 
any instructions to attempt to repeat. But the numbers were assigned 
while the subject was engaged in irrelevant learning tasks, approximat- 
ing the kinds of tasks used in Thorndike’s experiments. Depending upon 
the task used, percentages of repetition equal to and greater than those 
found by Thorndike could be produced — all without reward and any 
learning related to the number sequences. A few of the results are sum- 
marized graphically in Figure 5. A change in instructions, and the con- 
cealing of the record of previous replies, raised the level of repetition, 
without any new task being introduced; when an irrelevant learning task 
was used, with a fixed order of stimulus presentation (typical of Thorn- 
dike’s practice), the repetition of other ordered stimuli led to an en- 
hanced repetition of the supposedly random numbers; finally, a paired- 
associates task in which the assigning of a random number followed the 
attempt to recall the response member of the associated pair led to the 
highest level of repetition of numbers on succeeding trials. These experi- 
ments lend further support to the hypothesis, and ap- 

pear to make unnecessary the attributing of the spread of effect to a con- 
trol by way of reward. 

One dissent in recent years from the negative findings on the spread 
of effect is that of Duncan (1951). Aware of the many criticisms of the 
spread of effect, he designed an experiment in which the stimulus - member 
of the associated items would again be important. (It must be remem- 
bered that most of the criticisms center around the fact that the spread 
of effect is generated by response-habits having little to do with the na- 
ture of the stimuli presented. ) To this end he used two sets of stimulus 
words. One set consisted of unrelated adjectives (“ancient” “doymcast,” 
“mutual”) while the other consisted of adjectives very similar in mean- 
ing (“superb,” “choice,” “unique”). Duncan suggested.that saying Right 
to a number assigned to one of the similar adjectives might very well 
lead to a repetition of a number assigned to another similar adjective. 

33 Thorndike (1933b). The experiments averaged for comparison with Sheffield's 
are Experiments F, H, and I, 
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k In other words, the generalization tendencies might lead to an enhance- 
ment of the spread of effect, through uncertainty as to the exact locus of 
the reward. By the use of the control adjectives of dissimilar meanings, 
presumably guessing tendencies and the like would be balanced out. 
His results were largely ambiguous, but the one statistically significant 



FIGURE 5. Modified conditions of experimentation raising the level of repetition 
in a “spread-of -effect” type of experiment without reward or relevant learning. Plotted 
from the data of Sheffield and Jenkins (1952). 

difference he found favored his hypothesis and thus gave some slight 
support to Thorndike’s position. Perhaps the main lesson to be learned 
is that it is very difficult to design the finally decisive experiment, and 
despite all its critics the spread of effect is not yet permanently de- 
molished. 33 

The suggestion from Duncans experiment that uncertainty as to 

83 There are some unfortunate weaknesses in Duncan’s experiment that prevent its 
making a strong case for keeping the spread of effect alive. For example, he finds more 
of a gradient (more repetition, and more difference between the two adjacent positions) 
following a “rewarded” response not i c pealed than following a “rewarded” response that 
is repeated. Perhaps his too-similar adjectives have displaced the action of reward 
rather than spread it! 
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what was rewarded may have something to do with the spread of effect 
receives indirect confirmation from experiments by Nuttin ( 1949 ) ( 1953 ) . 
The subject was given a list of 40 well-known stimulus words. Each 
word was followed by five two-syllable words of an ‘unknown language” 
(actually artificially constructed nonsense words). He was asked to 
choose the word that was the exact translation of the stimulus-word, and 
then told whether he was Right or Wrong. The rights and wrongs were 
assigned in a prearranged manner. The list was then repeated a second 
time, but now the subject was asked to repeat the reply he had given the 
first time, adding whether the choice had been called Right or Wrong . 
^Nuttin found that wrong items next to ones called Right were remem- 
bered as having been called Right more often than those farther re- 
moved. jThere is thus some subjective uncertainty as to the locus of the 
experimenter’s assigned Rights and Wrongs . (Correct responses in the 
neighborhood- of a Wrong were also often remembered as having been 
called Wrong.) 

Some Methodological Lessons from the Spread-of-Ejfect 

Experiments 

The spread-of-effect experiments stand as a warning of the many 
pitfalls confronting the experimenter seeking conscientiously to design 
experiments relevant to theory. The spread-of-effect experiment epito- 
mized Thorndikes final system: discrete connections, automatic ( “bio- 
logical” ) influence of rewards, ascendancy of reward over punishment. 
The experimental results were objective, quantitative, reproducible by 
others. Thorndike was so pleased with these experiments as a proof of 
the validity of his system that he turned a deaf ear to the critics. In 
preparing the collection of his papers for his final book he inserted this 
comment regarding the criticisms of the spread-of-effect experiments: 
‘There are a few recent dissenters, but if they will repeat the original 
experiments in full I prophecy that they will be convinced.” 34 

The causal factors in any learning experiment are complex. The 
positive contribution of the experiments critical of Thorndike has been to 
call attention to a number of these factors that he either overlooked or 
slighted in his description and explanation of the spread phenomenon. 
Some of these are: 

1. The phenomena are influenced by the set or instructions that de- 
termine what the subject is trying to do. 35 

34 Thorndike (1949), page 27. 

35 Wallach and Henle (1941). 
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2. The learning task undertaken may be irrelevant to the spread 

phenomena, yet influence the level of repetition. 36 • 

3. An isolation or emphasis effect is as important as reward in de-‘ 
termining which response is most frequently repeated. 37 

4. Guessing habits result in a gradient of response around the focal 
response that is repeated for any reason whatever. 88 

5. A probability bias operates so that a subject repeats a digit at 
less than chance frequency when attempting to respond in random 
fashion. This avoidance of one or more previously used digits leads to a 
beyond-chance repetition of the remaining digits. 30 

6. Integration of serial acts through anticipatory responses, remote 
associations, etc., produces types of gradient superficially similar to, but 
not causally identifiable with, the spread of effect. 40 

If errors enter because of these six causal factors, they are errors 
in explanation or interpretation. They do not in themselves imply poor 
experimentation, for the factors may themselves be systematically varied 
and their effects studied. 

There are, in addition, errors in the treating of data that led to 
systematic distortions of the results as these results were reported from 
many of the earlier experiments on spread of effect that attempted to do 
just what Thorndike recommended, that is, to repeat his experiments in 
full. Three of these are worth noting: 

1. A faulty baseline of chance was selected, on the assumption that 

empirical chance levels are equal to computed random levels. 41 This 
error led to an underestimation of the influence of punishment (saying 
Wrong). * 

2. Improper weighting of steps in the gradient by scoring responses 
between two rewarded responses ( Rights ) as both preceding and follow- 
ing a Right.* 2 This error led to a greater symmetry in the "before" and 

36 Sheffield and Jenkins ( 1952 ) . 

S7 Zirkle (1946b). The possible role of emphasis was earlier suggested by Tolman, 
Hall, and Bretnall ( 1932 ) . The isolation effect came to Zirkle’s attention through the 
work of von Restorff (1933). Strengthened response through isolated punishment, 
noted by Zirkle, has been confirmed by Stone ( 1953). 

38 Jenkins and Sheffield ( 1946 ) . The suggestion was earlier made by Tolman ( 1936), 
in a review of one of Thorndike's books, but he did not follow it up. 

89 Smith ( 1949 ) . 

40 Jenkins ( 1943 ) found data apparently supporting the spread of effect in a maze 
experiment with rats, but anticipatory and perseverative tendencies make unsuitable an 
interpretation in terms of spread of effect to discrete connections. We shall later meet 
other gradients, such as the gradient of reinforcement, which have only a family 
resemblance to spread of effect. 

41 Pointed out by Stephens ( 1934), Hull ( 1935d), Tilton ( 1939). 

42 Pointed out especially by Tilton ( 1939) ( 1945). 
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"after” gradients than has been found in later experiments avoiding this 
error. 

3. Repeating lists many times in the same order, thus increasing the 
proportion of Right responses through learning and reducing both the 
prevalence of Wrong responses and their distance from the nearest Right . 
As a consequence the Wrong responses that remain are predominantly 
those which (for any reason) are stereotyped and resistant to change. 
Because late in practice these are necessarily near to Right responses, it 
will appear statistically (but spuriously) that Wrong responses are re- 
peated because of their proximity to Right ones. 43 

It is seldom that a single variety of experiment is as crucial to 
a systematic theoretical position as the spread-of-effect experiment was 
to Thorndike’s. For that reason its criticisms have been more pertinent 
than the criticisms of less important experiments. That the criticisms have 
also been more devastating is merely a matter of fact. Had the spread of 
effect been confirmed by repeated experiments under appropriate con- 
ditions, its explanation would have been accepted as an impressive con- 
tribution to psychological theory. 

ESTIMATE OF THORNDIKE’S POSITION 

Before proceeding to an appraisal of the contemporary significance 
of Thorndike’s position, his answers to the standard problems of learning 
will be summarized. By attempting such a summary for each of several 
leading learning theorists it will be easier to keep perspective on their 
similarities and differences. 

Thorndike's Position bn Typical Problems of Learning 

Thorndike’s answers are briefly summarized according to the six 
typical problems discussed in the first chapter. 

1. Capacity . Learning capacity depends upon the number of bonds 
and their availability. The differences between bright and dull are quan- 
titative rather than qualitative, although intelligence has dimensions of 
altitude as well as of breadth. 44 The theory of intelligence is consonant 
with the identical-elements theory of transfer. 

2. Practice . Repetition of situations does not in itself modify con- ' 
nections. Repetition of connections leads to a negligible increase in 

43 This criticism, although fairly obvious, was first noted in the earlier edition of this 

book. ^ 

44 Thorndike and others (1927). 
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strength, unless the connections are rewarded. Practice is important be- 
cause it permits rewards to act upon connections. 

3. Motivation. Reward acts directly on neighboring connections to 
strengthen them; punishment has no corresponding direct weakening ef- 
fect. Punishment may work indirectly, however, through making the 
learner do something else which may confront him with a reward. “Ideas” 
need not intervene; connections may be strengthened directly, without 
awareness. 

4. Understanding. The role of understanding is minimized, not be- 
cause it is undemonstrable, but because it grows out of earlier habits. 
The best way to get understanding is to build a body of connections ap- 
propriate to that understanding. When situations are understood at once 
it is a matter of transfer or assimilation, that is, there are enough elements 
in common with old situations to permit old habits to act acceptably. 

5. Transfer. The theory of identical elements is espoused. Reaction 
to new situations benefits by the identity of these new situations, in part, 
with old situations, and also by a principle of analogy described as as- 
similation. 

6. Forgetting. The original law of disuse assumed forgetting to take 
place without practice in accordance with the empirical findings of studies 
such as those of Ebbinghaus. Later books did not deal with the problem 
in any detail; the law of disuse was not mentioned, but some decay with 
no practice was still implied. 

The flavor of Thorndikes theory was all along that of the automatic 
strengthening of specific connections — directly, without intervening ideas 
or conscious influences. While Thorndike was not an avowed behaviorist, 
and was willing occasionally to use subjective terms, his emphasis was 
certainly behavioral. It would be unfair to leave the discussion of Thorn- 
dike without referring again to- his insistence on measurement, and 
through that insistence his contribution to the improvement of the learning 
of skills in the schools. There was an energetic empiricism about Thorn- 
dike’s experimenting and theorizing which compensated for its lack of 
systematic elegance. 

The Specificity Doctrine a Source Both of Strength 
and of Weakness 

Thorndike gave great impetus to what has sometimes been called 
the scientific movement in education — the movement which suggests that 
educational practices be regulated according to verified outcomes of 
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specific practices. His tremendous drive led to enormous output in fields 
as varied as handwriting scales, dictionary-writing, methods in teach- 
ing arithmetic and spelling, intelligence tests, and vocational guidance. 
But the secret of his output was not only energy: the output stemmed 
also from his matter-of-fact conception of science, that in order to do 
something about anything you have to know specifically what you are 
about. 

The specificity doctrine helps you to roll up your sleeves and get to 
work. Consider, for example, all the complications involved in the teach- 
ing of reading. What is it that the child is to be taught? Philology? Gram- 
mar? Semantics? It took a Thorndike to give the simple answer: "Words/ 5 
With that answer he proceeded to count the frequency with which each 
word occurs in English, by tabulating millions of printed words from all 
manner of sources. He then arrived at the most common words. These 
are the words which must surely be understood. He made available lists 
and dictionaries to facilitate teaching the most needed words. A speci- 
ficity theory like Thorndike's Tells the educator where to look and how to 
measure in a baffling field-iuct as schoolroom practices. 

The specificity doctrine i£ also a source of weakness, and it has been 
the target of the most severe attacks upon Thorndike. The illustration 
above shows the kinds of criticisms which Thorndike invites. Is language 
no more than words? Are the most frequent words really what we wish 
to teach? Perhaps we need to think of language as a means of expres- 
sion, as logic in action, and must therefore equip the child with the 
minimum set of tools necessary for adequate communication. That this 
approach is a possible one has been shown by the development of Basic 
English, wherein the central vocabulary of 850 words overlaps only in 
part with Thorndike's most frequent words. 45 The approach of Basic 
English takes into account the organized character of language as an 
instrument of meaning. Thorndike, true to association tradition, tended to 
think of language as a collection of words, which he set out to treat 
quantitatively. 

In Thorndike the analytical emphasis of all association theory also 
pervaded the conception of rewards and punishments, and weakened 
somewhat the analysis of these phenomena. The notion that the law of 
effect works mechanically on all connections in the neighborhood of the 
rewarded one makes of reward something extrinsic to the activity in 
question, something pinned on adventitiously. In showing that the law 

45 E.g., Richards (1943). The basic list of 850 words includes some words in the 
seventh thousand of Thorndike's count, such as advertisement and sneeze. 
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of effect may work this way, he slighted the internal relationships be- 
tween success and what the individual is trying to do, goals which satisfy 
aroused motives or needs. Again, texts may be cited in proof of the fact 
that Thorndike knew all this — he early quoted the 4 interest series" as 
we have shown and later added the notion of “belongingness” — but that 
does not alter the conclusion that his scientific preoccupations led him 
away from the internal relations of effort and success to the external re- 
lationship of any satisfying state of affairs strengthening any connection 
which happened to be near it. 

Perhaps more heat has been generated over Thorndike's subordina- 
tion of insight and understanding to drill and habit than over any other 
aspect of his writings. While he thought insight very rare in animals — 
perhaps rarer than it actually is — he did not deny insight in man. He was 
not awed by it, and thought it best understood by the same associative 
laws applying in other situations. Just as erroneous inferences are made 
because of habitual associations which throw the learner off his course, 
so the insights of the genius are made by appropriate habitual associations 
and analogies. He had this to say of reaction to novel situations: 

There is no arbitrary hocus pocus whereby man's nature acts in an unpre- 
dictable spasm when he is confronted with a new situation. His habits do not 
then retire to some convenient distance while some new and mysterious entities 
direct his behavior. On the contrary, nowhere are the bonds acquired with old 
situations more surely revealed in action than when a new situation appears. 46 

Although this comment is true enough, Thorndike's failure to give 
real concern to the way in which past habits are utilized in problem solu- 
tion, to consider what arrangement makes a problem hard, what easy, 
when the same essential bonds are involved, is a genuine limitation. The 
difference is a real one for school practice. It is possible to teach num- 
ber combinations first (establish the “bonds”), then expect some glimmer 
of understanding later, or it is possible to achieve some understanding 
of what numbers are for, to comprehend the situation as a problem, and 
then to learn the combinations in this context. In the end you come out 
at the same place, knowing the tables and knowing how to use them, but 
it is not a foregone conclusion that the one method of teaching will be 
more efficient than the other, either for knowing precisely what has been 
taught or for being able to apply it in new situations. Thorndike's pre- 
occupation with bonds has insured that we turn to others, not Thorndike’s 
followers, for a more careful appraisal of the role of meaning and under- 
standing. 

48 Thorndike ( 1913b), page 29. 
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SUPPLEMENTARY READINGS 


BOOKS 

Thorndike was a prolific writer. His bibliography appears in two parts in 
the Teachers College Record: for the years 1898 to 1940 in volume 41 (1940), 
pages 699-725; for the years 1940 to 1949 in volume 51 (1949), pages 42-45. 
The total comes to more than 500 items. 

The following books contain his major contributions to learning theory, with 
much supporting experimental data: 

Thobndike, E. L. (1911) Animal intelligence. 

Thorndike, E. L. (1913) The psychology of learning ( Educational psychology , 
Vol.II). 

Thorndike, E. L. (1922) The psychology of arithmetic. 

Thorndike, E. L. and others (1928) Adult learning. 

Thorndike, E. L. (1932) The fundamentals of learning. 

Thorndike, E. L. ( 1935 ) The psychology of wants , interests , and attitudes . 
Thorndike, E. L. (1949) Selected writings from a connectionist’ s psychology . 

shorter introductions 

Gates, A. I. (1942) Connectionism: Present concepts and interpretations. Chap- 
ter 4 in The psychology of learning. Natl. Soc, Stud. Educ., 41st Yearbook, 
Part II, 141-164. 

Rock, R. T., Jr. (1940) Thorndike’s contributions to the psychology of learning. 
Teach. Coll. Rec., 41, 751-761. 

Sandiford, P. (1942) Connectionism: Its origins and major features. Chapter 3 
in The psychology of learning. Natl. Soc. Stud. Educ., 41st Yearbook, Part 
II, 97-140. 

Thorndike, E. L. (1940) The ABC of human behavior. Chapter 2 in Human na- 
ture and the social order , 5-20. 

critical reviews 

Brown, J. F., and Feder, D. D. (1934) Thorndike’s theory of learning as gestalt 
psychology. Psychol. Bull., 31, 426-437. 

Cason, H. (1932) Review of Thorndike’s Human learning. J. abnorm. soc. 
Psychol , 27, 214-222. 

Hull, C. L. (1935) Special review of Thorndike’s The fundamentals of learning. 
Psychol Bull. , 32, 807-823. 

McGeoch, J. A. (1933) Review of Thorndike’s The fundamentals of learning . 
/. gen. Psychol , 8, 285-296. 

Tolman, E. C. (1936) Connectionism; wants, interests, and attitudes. Character 
and Pers., 4, 245-253. 

representative experiments 

A scientific theory can best be judged by its consequences in experimenta- 
tion. This requires an appraisal of the kinds of experiments to which the theory 
leads and a review of the interpretations offered for the obtained data. A pre- 
ferred theory ought to lead to cogent experiments and to suggest convincing 
interpretations. 

Following this and succeeding chapters there will be short lists of experi- 
ments, useful in estimating the theories in this way. Many other references would 
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be equally satisfactory for this purpose. The items marked with an asterisk are 
reproduced in large part in Thorndike ( 1949) . 

Lorge, L (1936) Irrelevant rewards in animal learning. jf. comp. Psychol ., 21, 
105-128. 

Postman, L., and Jarrett, R. F. (1952) An experimental analysis of learning 
without awareness/ Amer. J. Psychol ., 65, 244-255. 

Rock, R. T., Jr. (1935) The influence upon learning of the quantitative variation 
of after-effects. Teach. Coll . Contr. Educ ., No. 650. 

Stone, G. R. (1953) The effect of negative inventives in serial learning: VII. 

Theory of punishment. J. gen. Psychol., 48, 133-161. 

Thorndike, E. L. (1908) The effect of practice in the case of a purely intel- 
lectual function. Amer. J. Psychol., 19, 374-384. 

Thorndike, E. L. (1910) Practice in the case of addition. Amer. J. Psychol., 
21, 483-486. 

Thorndike, E. L. (1924) Mental discipline in high school studies. J. educ. 
Psychol, 15, 1-22; 83-98. 

^Thorndike, E. L. (1927) The influence of primacy. J. exp. Psychol, 10, 18-29. 
^Thorndike, E. L. (1932) Reward and punishment in animal learning. Comp. 
Psychol. Monogr., 8, No. 39, 65 pp. 

^Thorndike, E. L. (1933) An experimental study of rewards. Teach. Coll 
Contr. Educ., No. 580, 72 pp. 

Thorndike, E. L., and Lorge, I. (1935) The influence of relevance and be- 
longing. J. exp. Psychol, 18, 574-584. 

Thorndike, E. L., and Rock, R. T., Jr. (1934) Learning without awareness of 
what is being learned or intent to learn it. ]. exp. Psychol, 17, 1-19. 

FINAL NOTE 

For notices of the first edition of this book, disagreeing with my interpreta- 
tion of some aspects of Thorndikes system, see: 

Stone, G. R. (1948) Hilgard on the dominant laws of learning. Psychol. Rev., 
55, 342-344. 

Woodworth, R. S. (1949) Review of Hilgard’s Theories of learning. J . abnorm. 
soc. Psychol, 44, 124-129. 

The following reviews bear importantly on* the law of effect as a central 
feature of Thorndike's system: 

Meehl, P. E. (1950) On the circularity of the law of effect. Psychol Bull, 47, 
52-75, 

Nuttin, J. (1953) Tdche, reussite et echec. Especially Chapter V, pages 247- 
304. 

Postman, L. ( 1947) The history and present status of the Law of Effect. Psychol. 
Bull, 44, 489-563. 



Chapter 3 

GUTHRIE’S CONTIGUOUS 
CONDITIONING 


I N some respects, Edwin R. Guthrie’s system is very much like Thorn- 
dike’s. It is a stimulus-response association psychology, objective and 
practical. But in other respects the systems are very different. It is such 
similarities and differences which define the problems of contemporary 
learning theory. 

Thorndike accepted two kinds of learning: selecting and connecting 
(under the law of effect), and associative shifting. For him, associative 
shifting was originally the fifth of some subsidiary principles, and by far 
the major burden was carried by selecting and connecting. For Guthrie, 
on the contrary, a conception very like associative shifting is made the 
cornerstone of his system, and learning of the trial-and-error sort is 
derivative and secondary. Guthrie does not accept the law of effect in 
Thorndike’s sense, and this is the basic cleavage between their systems. 

THE BACKGROUND OF BEHAVIORISM 
AND OF CONDITIONING 

Guthrie is a contemporary behaviorist who lays much stress upon 
conditioning as the characteristic form of learning. Before turning to his 
own theories it may prove helpful to consider the background against 
which they have emerged. 

Watson’s Behaviorism 

Behaviorism as a "school” of psychology is usually thought of as 
originating with Watson (1913), who became its most vigorous spokes- 
man. There were other varieties of behaviorism, however, and Guthrie 
was led to his position by way of Singer (1911), with whom he had 
studied. The behaviorists have in common the conviction that a science 

48 
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of psychology must be based upon a study of that which is overtly ob- 
servable: physical stimuli, the muscular movements and glandular secre- 
tions which they arouse, and the environmental products which ensue. 
The behaviorists differ somewhat among themselves as to what may be 
inferred in addition to what is measured, but they all exclude self- 
observation (introspection) as a legitimate scientific method. Partly as 
a protection against an indirect use of introspection, the behaviorists have 
tended to prefer experimentation on animals and infants. 

Watson's Behavior : An introduction to comparative psychology 
(1914) was the first book to follow the announcement of his new posi- 
tion. In it occurred his attempted refutation of Thorndike's law of effect 
and the substitution of the laws of frequency and recency. He believed 
that animal learning, as in the maze or problem-box, could be explained 
according to what the animal had most often been led to do in the situa- 
tion, with the most recent act favored in recall. Because the successful 
act was both most frequent and most recent, its recurrence could be ex- 
plained without recourse to an added principle of effect. This denial of 
effect was part of his program of getting rid of the residual subjectivity 
which he felt was implied in Thorndike's concepts of satisfiers and an- 
noyers. While the frequency-recency theory did not survive its criticisms, 1 
it serves to point up Watson’s desire to find objective laws to substitute 
for those with even a tinge of subjective flavor. 

The behaviorist knows that other events intervene between measured 
stimuli and the responses to them. In order to preserve a systematically 
coherent position, these intervening events are posited to be much like 
the observed ones, that is, implicit or covert stimulus-response sequences. 
In his early studies on the control of the maze habit, Watson ( 1907 ) had 
attributed great importance to kinesthetic stimuli as integrators of the 
habits involved. Because kinesthetic stimuli are aroused as a result of the 
organism's movements, they fit well into a behavioral or reaction psy- 
chology. Even the unobserved processes inferred to be going on between 
stimuli and responses are said to be comprised of movements and 
movement-produced stimuli. This emphasis upon kinesthesis as the 
integrator of animal learning served Watson well when he became puz- 
zled about human thought processes. He decided that thought was merely 
implicit speech, that is, talking to oneself. Sensitive enough instruments, 
he conjectured, would detect tongue movements or other movement 
accompaniments of thinking. He was thus able to hold to his consistent 
behaviorist position without denying that thinking goes on. 

1 Peterson (1922); Gengerelli (1928). 
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It was somewhat later that Watson discovered that the conditioned 
reflex of Pavlov and Bekhterev might serve as a useful paradigm for 
learning. 2 Because it grew out of the objective tradition that had hap- 
pened to develop within Russian physiology, it fitted his temper and 
he adopted it enthusiastically. In Watson’s later writings the condi- 
tioned reflex was central to learning, as the unit out of which habits are 
built. 

Watson’s general textbook, Psychology from the standpoint of a 
behavioristy appeared in 1919. It was soon followed by other books writ- 
ten from an avowedly behavioristic standpoint. Among these was Smith 
and Guthrie’s General psychology in terms of behavior (1921). Like 
Watson’s book, it treated all of psychology from a behavioral viewpoint 
and made use of conditioning principles. It, too, laid great stress upon 
movement-produced stimuli. Hence there is a family relationship between 
the two books, although they differ greatly in expository style. Watson 
'laid far more stress upon the details of physiology and anatomy, and 
upon appropriate methods for the behavioral study of psychological re- 
lationships. Smith and Guthrie showed less concern for experimental 
and neurophysiological detail, but instead gave a plausible interpretation 
of ordinary experience as described consistently from the new standpoint. 
Guthrie’s contemporary writings preserve the flavor of the Smith and 
Guthrie book. Despite their similarities, Guthrie’s point of view must be 
considered as something other than a working out of Watson’s position. 3 

The Conditioned Reflex or Conditioned Response 

The conditioned-reflex principle is conceptually so simple as to make 
an attractive prototype for learning. Certain stimuli lead regularly to re- 
flex responses. A light flashed into the eye causes pupillary constriction, 
a tap on the patellar tendon leads to a knee jerk, chemicals on the 
tongue produce salivation. Such natural reflexes are called unconditioned 
reflexes and the stimuli which produce them are called unconditioned 
stimuli . If a second stimulus, not originally leading to the response, is 
frequently presented slightly before or accompanying the unconditioned 
stimulus, it presently comes to elicit the response. Such a stimulus is 
called a conditioned stimulus and the learned response is called a con- 
ditioned reflex. Because of the desire to extend the principle to responses 
not obviously reflex, later writers have usually preferred the less specific 

2 Watson (1916). 

3 It is suggested in Chapter 5 that Hull's position is another kind of descendant of 
Watson's. 
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word "response” to the word "reflex,” so that the movements involved 
are called conditioned and unconditioned responses, whether or not they 
conform to the definition of a reflex. 

The study of conditioned salivary responses in dogs was carried out 
systematically by Pavlov 4 over many years, and he discovered most of 
the relationships which later studies have more fully explored. The trans- 
lations of his terms have become common in the literature of learning. 

The appearance or strengthening of a conditioned response is said 
to take place through reinforcement. Reinforcement may refer to the ex- 
perimental arrangement of following the conditioned stimulus by the un- 
conditioned stimulus, although the word is sometimes used to refer to 
the implied process of strengthening of the conditioned response. When 
the conditioned stimulus is presented alone, unaccompanied by the un- 
conditioned stimulus, the conditioned response is said to be non - 
reinforced. Under repeated non-reinforcement there is a tendency for 
the conditioned response to weaken or even to disappear completely. This 
is referred to as experimental extinction , or, more simply, extinction. By 
extinction may be meant either the experimental arrangement of non- 
reinforcement, or the weakening process itself. Extinction is not, in fact, 
a destruction mi the conditioned response, for when the animal is returned 
to the laboratory after a rest following extinction, the conditioned re- 
sponse is usually found to have reappeared, though commonly at reduced 
strength. This return of response strength after extinction, without inter- 
vening reinforcement, is called spontaneous recovery. The simple history 
of a conditioned response is then its strengthening through reinforce- 
ment, its weakening through extinction, and its partial recovery with 
rest. 

Two additional concepts are extremely important in the extension 
of conditioning principles. These are generalization and discrimination . 
A new conditioned stimulus, not previously reinforced, may elicit a con- 
ditioned response the first time it is presented. The probability that it 
will do so is increased if it is similar to the conditioned stimulus which 
has been reinforced. Thus if a conditioned stimulus is obtained to one 
tone, another tone, at a slightly different frequency, will also produce a 
conditioned response, with lesser magnitude the further the separation 
of the tones. This process whereby a novel stimulus produces a response 
learned to another similar stimulus is known as generalization . If the two 
stimuli are sufficiently distinguishable, the organism can be taught to re- 
spond to one of them and to cease responding to the other. Such a dis- 

^ Pavlov (1927) (1928). 



52 THEORIES OF LEARNING 

crimination is produced by the method of contrasts. That is, one of the 
stimuli is regularly reinforced, the other regularly non-reinforced. The 
selective extinction which results is known as conditioned discrimination , 
because the organism has learned to react differentially to the two stimuli 
which at first evoked more nearly the same response. Discrimination may 
be between patterns of stimuli as well as between single stimuli. 

Pavlov’s paradigm of conditioning has come to be called classical 
conditioning to distinguish *it from a variety known as instrumental con- 
ditioning. Instrumental conditioning is epitomized by an experiment to 
be described more fully in the next chapter: lever-pressing by the rat, 
rewarded by a pellet of food. Because pressing the lever is causally re- 
lated to the appearance of food the learned response is an instrumental 
one, that is, it acts upon the environment to produce a given change. The 
two chief differences between classical conditioning and instrumental 
conditioning lie in the relationships between the conditioned response 
and the reinforcing event: 

1. Classes of response. In classical conditioning, the conditioned 
and unconditional responses fall in the same class. That is, conditioned 
salivation is reinforced by presenting an unconditioned stimulus that 
elicits unconditioned salivation. In instrumental conditioning, the two 
responses fall into different classes: conditioned lever-pressing is rein- 
forced by food, and unconditioned eating responses have little in com- 
mon with lever-pressing. 

2. Contingent relationship between conditioned and unconditioned 
stimuli. In classical conditioning, reinforcement is under the control of 
the experimenter. The unconditioned stimulus (e.g., food) appears after 
a standard time interval whether or not the dog responds to the condi- 
tioned stimulus (e.g., bell). In instrumental conditioning the reinforce- 
ment is controlled by the behaving organism. That is, the rat receives 
the reinforcing stimulus (food) only if it gives the conditioned response 
of pressing the lever. 

Despite these differences in experimental arrangements, the general 
laws of reinforcement, extinction, spontaneous recovery, generalization, 
and discrimination are so similar within classical and instrumental con- 
ditioning that a common terminology has come to be used for both 
varieties, 5 and it is a matter for theoretical discussion whether or not 
separate principles of learning apply to the two classes. 

In choosing to use what has been learned from conditioned-response 

5 A glossary of terms used in conditioning experiments may be found in Hilgard and 
Marquis (1940), pages 341-353. 
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experiments, Watson and Guthrie selected somewhat different alterna- 
tives. Watson used the Pavlov experiment as the paradigm of learning, 
made of the conditioned reflex the unit of habit, and built his system on 
that foundation. Guthrie, unlike Watson, starts with a principle of con- 
ditioning or of associative learning, a principle which is not dependent 
'strictly upon the Pavlov kind of experiment To Guthrie, Pavlov's ex- 
periment is a highly artificial one, useful for the relationships it exhibits, 
but itself requiring explanation according. to more general principles. 
For Watson, the classical conditioning experiment provided the answers 
he sought as to how learning takes place; for Guthrie, the classical con- 
ditioning experiment raises as many questions as it answers. 


CONTIGUITY OF CUE AND RESPONSE: THE ONE 
LAW OF ASSOCIATION 

The Lowest Common Denominator of Learning 

Guthrie's one law of learning, from which all else about learning is 
made comprehensible, is stated by Guthrie as follows: “A combination 
of stimuli which has accompanied a movement will on its recurrence tend 
to be followed by that movement” 6 

There is an elegant simplicity about the statement, which avoids 
mention of drives, of successive repetitions, of rewards or of punishments. 
Stimuli and movements in combination: that is all. This one principle 
serves as the basis for a very ingenious and intriguing theory of learning. 

A second statement is needed to complete the basic postulate about 
learning: “A stimulus pattern gains its full associative strength on the 
occasion of its first pairing with a response.” 7 

This somewhat paradoxical statement, in view of undeniable im- 
provement with practice, is a very necessary adjunct to the theory, be- 
cause it makes possible a number of derivative statements about learn- 
ing and forgetting. It can be thought of as a kind of recency principle , 
for if learning occurs completely in one trial, that which was last done 
in the presence of a stimulus combination will be that which will be done 
when the stimulus combination next recurs. 8 

6 Guthrie (1935), page 26; Guthrie (1952), page 23. Where the quotations from 
Guthrie remain unchanged between the 1935 and the 1952 editions of his book, the 
earlier only will be cited hereafter. 

7 Guthrie (1942), page 30. 

8 Because it refers to the last response of a succession rather than to recency in time, 
Voeks ( 1948 ) ( 1950 ) suggests that it be called the principle of postremity. 
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How can Guthrie demonstrate that more complicated forms of learn- 
ing conform to these simple principles? As in the case of other sophisti- 
cated theorists, he does not proceed by denying familiar forms of learn- 
ing. His problem is that of showing that learning as we know it can, in 
fact, be shown not to contradict these basic principles. He does not deny 
that there is learning which may be described as insightful or purposive 
or problem-solving. It is Guthrie's task to show that each of these forms 
requires no new principles of explanation beyond the primary law of as- 
sociation by contiguity. 

Why Strict Contiguity of Measured Stimulus and 
Response Is Not Essential 

One of the standard experiments in the literature of conditioning is 
that showing the importance of the time interval between the conditioned 
stimulus and unconditioned response. The empirical results suggest a 
gradient, with a most favorable interval and less favorable intervals on 
either side of this optimal interval. 9 

Guthrie is able to hold out for strict simultaneity of cue and response 
in the face of these data by proposing that the true cue being conditioned 
is not the stimulus as measured. An external stimulus may give rise to 
movements of the organism. These movements in turn produce stimuli. 
When associations appear to be made between stimuli and responses 
separated in time, it is because these intervening movements fill in the 
gap. The true association is between simultaneous events. 

There is a strong preference for movement-produced stimidi as the 
true conditioners in Guthrie's system. They permit the integration of 
habits within a wide range of environmental change in stimulation, be- 
cause these stimuli are carried around by the organism. It appears that 
some of this preference dates from the early emphasis of Watson (1907) 
on kinesthesis as the basis of control of the maze habit, a position no 
longer tenable. 10 Such covert movement-produced stimuli provide ever- 
present explanations for conduct which cannot be inferred from external 
stimulus-response relationships. 

9 Hilgard and Marquis (1940), pages 158-173. Additional determinations of this 
stimulus-separation gradient include Reynolds (1945a), Kimble (1947), White and 
Schlosberg (1952), McAllister (1953), For somewhat discordant results, see also 
Libby (1951). 

10 Honzik (1936) found kinesthesis to be one of the least useful of several sensory 
controls of the maze habit. 
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Why Repetition Brings Improvement 

The reason that practice brings improvement is that improvement, 
and other forms of success, refer to acts, to outcomes of learnings, rather 
than to movements. Guthrie believes that his interest in movements, and 
the prediction of movements, is almost unique among learning theorists; 
others, he says, are interested in goal achievements, and results of one 
sort or another. One difference between him and Thorndike is that 
Thorndike is concerned with scores on tasks, with items learned, pages 
typed, or correct responses attained. Guthrie is concerned only with 
the movements of the organism, regardless of whether they lead to 
error or success. 

A skill, such as getting the ball into the basket in a game of basket- 
ball, is not one act but many. It does not depend upon a single muscular 
movement, but upon a number of movements made under a number of 
different circumstances. Any one movement may be learned in any one 
trial, but to learn all the movements demanded by the complicated skill 
calls for practice in all the different situations: while near the basket 
and far away, on one side and on the other, with and without a guard 
nearby. Practice is necessary; but it produces its consequences, not in 
accordance with a law of frequency, but according to the simple princi- 
ple of the attachment of cues to movements. The more varied the move- 
ments' called for in a given act of skill, and the more varied the cues 
which must become assimilated to these movements, the more practice is 
required. There is no mystery about the length of time it takes to learn 
to operate a typewriter: there are so many keys in so many combina- 
tions, calling for the attachment of a great many cues to a great many 
responses. It is concomitantly necessary to get rid of the faulty associa- 
tions which lead to what, from a product point of view, is an error. This 
is done by having the correct behavior occur to the cue which previously 
gave rise to the faulty behavior. When finally all the cues lead to ac- 
ceptable behavior, the task is mastered. The apparent contradiction of 
single-trial learning with actual experience of painstaking fumbling 
before success is achieved is resolved when the skilled task is seen to 
be composed of a large number of habits. 

Associative Inhibition, Forgetting, and the Breaking 
of Habits 

The fact of extinction is one of the findim*s of conditioning experi- 
ments that is in need of explanation. Because elms should remain faith- 
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ful to their responses, Guthrie cannot agree to extinction as a decay due 
to mere non-reinforced repetition. According to him, extinction always 
occurs as associative inhibition, that is, through the learning of an 
incompatible response. His is an interference theory and hence requires 
no new principles, because the original learning and the interfering 
learning follow the same rules. 

Forgetting is explained in the same way. If there were no inter- 
ference with old learning there would be no forgetting. It has been shown, 
for example, that conditioned responses, even though in some respects 
they appear fragile, are actually quite resistant to forgetting. 11 The long- 
lasting character of these conditioned responses is to be understood be- 
cause they represent learning highly specific to cues not confronted in 
daily life. If the learners lived in the laboratory, their responses would be 
subject to more interferences. Guthrie's position is but an extreme form 
of the retroactive inhibition theory of forgetting, to be discussed in 
greater detail later. 

If it is desired to break up a habit (that is, to accelerate its forget- 
ting), it is only necessary to cause other movements to occur in the 
presence of the cues to the habit. The problem of locating the cues and 
substituting other behavior often takes time, because many cues may 
lead to the undesirable habit. 

Drinking or smoking after years of practice are action systems which can 
be started by thousands of reminders. ... I had once a caller to whom I was 
explaining that the apple I had just finished was a splendid device for avoiding 
a smoke. The caller pointed out that I was at that moment smoking. The habit of 
lighting a cigarette was so attached to the finish of eating that smoking had 
started automatically. 12 

Guthrie suggests three ways in which activities are commonly weak- 
ened: 13 

1. The first method is to introduce the stimulus which you wish to 
have disregarded, but only in such faint degree that it will not call out 
its response. This is the method of training a horse to the saddle by 
starting with a light blanket, and gradually working up to full equip- 
ment, at no time permitting the horse to become so startled that it plunges 
or struggles. 

2. The second method is to repeat the signal until the original re- 

11 E.g., Hilgard and Campbell ( 1936); Wendt ( 1937); Skinner (1950). 

12 Guthrie (1935), page 139. 

13 Paraphrased from Guthri^ 1935), pages 70-73. 
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sponse is fatigued, and then continuing it, so that new responses are 
learned to the signal. The “broncho-busting” of the western ranches fol- 
lowed essentially this technique, 

3. The third method is to present the stimulus when other fea- 
tures in the situation inhibit the undesirable response. One illustration 
given by Guthrie is that of training a dog not to catch and eat chickens 
by tying a dead chicken about its neck. As it struggles to get rid of the 
corpse it develops an avoidance response to chickens at close quarters. 
Another example, illustrating undesirable learning, is the disobedience 
learned by the child whose mother calls him when he is too occupied with 
what he is doing to obey. 

SOME DERIVATIVE EXPLANATIONS AND 
APPLICATION S 

In his two books The 'psychology of learning (1935) (1952) and 
The psychology of human conflict (1938), Guthrie finds it unnecessary 
to make any formal additions to the basic principles of learning in order 
to apply his theory to practical learning problems and to the handling 
of personality disorders. 14 While the basic principles remain, there are 
some ingeniously derived supplementations which, once accounted for, 
play an important role in the further discussions. Among these are ex- 
planations of the place of motives, the action of reward and punishment, 
and the origin of anticipatory responses as important substitutes for 
ideas and intentions. 

Motives 

The motivational state of the organism, its hunger, thirst, or state 
of comfort or discomfort, has no formal place in Guthrie's learning 
theory; the motivational state is important only because it determines 
the presence and vigor of movements that may enter into associative 
connection. The motive is important only for what it causes the organism 
to do. The movements that occur get associated; if a hungry cat acts 
differently from a well-fed cat, her movements are different and so her 
learning may be different. She learns what she does; what she does is 
more important than what her motivational state happens to be. In the 

14 The same comments apply to his collaborative books as well: Smith and Guthrie 
(1921), Guthrie and Horton ( 1946), Guthrie and Edwards (1949), Guthrie and 
Powers (1950). 
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Guthrie and Horton (1946) experiments the cat often did -not eat the 
salmon provided in the dish outside the cage. This did not matter, for 
leaving the cage was the important behavior. 

Motives are, however, important in one of the derivations from the 
basic theory of contiguous association: they are important in providing 
maintaining stimuli , keeping the organism active until a goal is reached. 
The goal removes these maintaining stimuli, and brings the activity to 
an end. 15 We shall see how these maintaining stimuli, along with move- 
ment-produced stimuli, tend to keep a series of acts integrated, and ac- 
count for anticipation, for behavior characterized by intent. 

Reward 

While Guthrie believes as everyone else does that rewards influence 
outcomes, his rejection of the law of effect and of the principle of rein- 
forcement in conditioning is based on the position that nothing new 
is added to associative learning by reward except a mechanical arrange- 
ment. This mechanical arrangement, which places reward at the end 
of a series of acts, removes the organism from the stimuli acting just 
prior to the reward. Hence, being removed from the stimuli, the behavior 
to these stimuli is preserved intact. Instead of behavior being strength- 
ened by reward, reward preserves it from disintegration. It was just as 
strong before the reward occurred, but, if there had been no reward, 
behavior in the same situation would have been changed. The act lead- 
ing to the reward, being the last act in the problematic situation, is the 
one favored when the situation next repeats itself. Guthrie is very explicit 
about this. Of an animal's excape from a problem-box he says: 

The position taken in this paper is that the animal learns to escape with its 
first escape . This learning is protected from forgetting because the escape re- 
moves the animal from the situation which has then no chance to acquire new 
associations. 

[Of latch-opening followed by food.] What encountering the food does is 
not to intensify a previous item of behavior but to protect that item from being 
unlearned. The whole situation and action of the animal is so changed by the 
food that the pre-food situation is shielded from new associations. These new 
associations can not be established in the absence of the box interior, and in 
the absence of the behavior that preceded latch-opening. 16 

Although this is the fundamental position with respect to reward, 
and frequently reiterated in refutation of the law of effect and related 
interpretations, the action of reward is found to be somewhat more com- 

15 Guthrie ( 1942), page 18^ 

16 Guthrie (1940), pages 144-145. Italics in the original. 
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plicated when one examines the totality of Guthries system. The first 
(and primary) role of reward is to remove the animal from the prob- 
lem and thus prevent unlearning. But by the principle of association the 
animal also learns the activity that he carries on in the presence of the 
reward (chewing and salivating to food, for example), and this behavior 
tends to be invoked by renewed hunger and by any of the cues from 
the problem that may have persisted while the rewarded behavior was 
going on. 

. . . There is one act, however, to which hunger may remain a faithful 
conditioner. That is the act of eating; and the faithfulness of hunger to this 
association derives from the fact that hunger dies when eating occurs. As Steven- 
son Smith and I pointed out in our General Psychology , elements of the consum- 
matory response tend to be present throughout a series of actions driven by a 
maintaining stimulus. 17 

Not only do general movements of eating tend to be aroused by hunger 
contractions, but the specific movements demanded by the peculiar nature of 
the food are possibly in evidence. Hence when the rat runs the maze he is ready 
for whatever reward has been received in the past, sunflower seed or bran mash. 
This readiness is an actual muscular readiness. . . . 1S 

A theoretical problem arises when Guthrie holds at once (a) that 
the reward removes the learner from all the cues prior to the reward, 
and ( b ) that these prior cues are somehow reward-attached. He could 
resolve this dilemma in one of two ways. He could retreat from the all- 
or-nothing removal of cues by reward: perhaps reward acts by changing 
the situation somewhat, even though not entirely. Or he could define 
the "true” reward as coming at a stage later than eating, perhaps consum- 
mation through food digestion. The animal is still hungry while eating, 
hence hunger produces anticipatory eating responses. Then, however, 
the act of eating does not remove the animal from the cues to which he 
has been exposed, because he has been exposed to eating-response cues 
all along. What finally changes the situation and brings the end of hunger 
(and presumably other cues) is satiation. Although this sounds like 
something of a quibble, the distinction between the theory of contigu- 
ous association and the theory of reinforcement is so slight that only the 
utmost clarity of statement will permit a choice between them. 

One illustration may serve to show how important it is to know 
exactly what the theory states. The experiment of Sheffield and Roby 
(1950) shows that the non-nutritive taste of saccharin may serve as a 
reward. The result is used as a refutation of the need-satiation theory 
of reinforcement and as support for the theory of contiguous associa- 

17 Guthrie (1935), pages 151-152. 

18 Guthrie (1935), page 173. 
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tion (as well as some other theories). The empirical support for Guthrie 
derives from the fact that the rat’s behavior changes strikingly after the 
saccharin is ingested. The authors conclude: 

It is suggested that elicitation of the consummatory response appears to be 
a more critical 'primary reinforcing factor in instrumental learning than the drive 
reduction subsequently achieved. 19 

While this consummatory response is the final one in a series, it 
has not yet taken the learner out of the situation. It must itself be part 
of the series, otherwise there could be no anticipatory responses reflect- 
ing it. What really takes the animal out of the situation? What is really 
reinforcing? Is it the new behavior that takes place after the consumma- 
tory response is over? But if it is this new behavior, does that not occur 
after some sort of stimulus-reduction has taken place? 

Guthrie’s general standpoint is clear enough: reward does not 
strengthen prior behavior, it merely protects it from unlearning. But the 
more precise questions about the relation of this position to his inter- 
pretation of anticipatory response remain unanswered. 

Punishment 

The primary interpretation of punishment is that of all associative 
learning: you tend to do what you did under the same circumstances: 

. . . Sitting on tacks does not d^ courage learning. It encourages one in 
learning to do something else than sit. Jt is no' l he feeling caused by punishment, 
but the specific action caused by punishment that determines what will be 
learned. To train a dog to jump through a hoop, the effectiveness of punishment 
depends on where it is applied, front or rear. It is what the punishment makes the 
dog do that counts or what it makes a man do, not what it makes him feel. 20 

What we can predict is that the influence of stimuli acting at the time of 
either satisfaction or annoyance will be to re-establish whatever behavior was in 
evidence at the time. 21 

If one were to stop with statements such as these, it would appear 
that Guthrie treats reward and punishment in a symmetrical fashion. 
Certainly punishment changes a situation very strikingly, as reward does. 
Hence one might infer that all antecedent behavior would remain intact, 
being protected from new learning by the altered conditions of punish- 
ment as much as by the altered conditions of reward. Sometimes this 
does actually appear to be the case. Running responses leading to charged 
grills, with safety beyond, may prove resistant to extinction. 22 

19 Sheffield and Roby ( 1950 ), page 481. 

20 Guthrie ( 1935), page 158. 

21 Guthrie (1935), page 154. 

22 Gwinn (1949), F. D. Sheffield (1949a). 
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The symmetry in treatment of reward and punishment, as implied 
in the foregoing quotations, is somewhat illusory, for “doing what you 
last did” refers to very different parts of the behavior cycle when the 
reference is to reward than when the reference is to punishment. “What 
you last did” that remains in your behavior repertory because of reward 
is what you did just before the reward appeared; “what you last did” in 
the case of punishment refers to what you did just after the punishment, 
leading to escape. Punishment produces “maintaining stimuli” relieved 
by a later movement that brings relief: 

, . . An animal on a charged grid, a barefoot boy on a hot pavement, a man 
sitting on a tack have as their goals mere escape from the intense stimulation 
that causes general tension and restlessness as well as specific movement. These 
stimuli continue to act as what Stevenson Smith and I called maintaining stimuli 
until some movement carries the subject away from the source of stimulation, or 
the source of stimulation away from the subject. 23 

When these stimuli are removed, we have the circumstances defin- 
ing reward, and so, if that were the whole story, the prior behavior should 
remain intact. This kind of relief from punishment has come to be known 
as escape learning to distinguish between it and avoidance learning. 
To move from escape learning to avoidance learning we require an antici- 
patory response, conditioned to some cue, so that the punishment is 
circumvented. As we saw in relation to reward, Guthrie's system makes 
provision for anticipatory responses, and these can, of course, be made 
use of to explain avoidance learning. The animal merely makes the 
escape response to some cue present at the time of punishment — a cue 
which, fortunately, makes its appearance before the threatened punish- 
ment. Hence what happens at the end of a sequence of acts leading to 
punishment does something in addition to and other than removing the 
organism from the scene; it also sets up some conditioned anticipatory 
responses. 

Noxious stimuli may lead not only to escape and avoidance learn- 
ing but to a third subvariety, more properly called punishment, that is, 
to the inhibiting of otherwise rewarded behavior. We may try to stop 
the child's doing something he enjoys; we may try to break a “bad 
habit” Punishment of this kind always involves conflict. 

Guthrie's position with respect to this subvariety of punishment has 
been aptly summarized by Sheffield: 

(a) Punishment works only if the last response to the punished situation 
is incompatible with the response that brought on the punishment 

25 Guthrie { 1935), page 165. 
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( b ) Punishment works only if the cues present when the incompatible re- 
sponse is performed are present when the punished response is performed. 

(c) Punishment that produces only emotional excitement will tend to 
fixate the punished response. 24 

The main point is that punishment wins out in a conflict with be- 
havior that is otherwise motivated only if there is conflict. The state- 
ment about the presence of cues is not as decisive as it sounds, in view 
of Guthrie’s acceptance of anticipatory responses and movement-pro- 
duced stimuli as cues. In other words, there are always plenty of cues, 
if one needs them. The third point is a useful one: In some cases, what 
appears to be punishing to the trainer may not be punishing to the learner; 
in other cases, punishment, even though annoying, may accentuate 
stereotyped behavior. 

Intentions 

Conduct is organized into sequences in which people make plans 
and carry them out, or at least start to carry them out. Guthrie is aware 
of this and devotes a chapter to learning with and without intention. 25 

He and Smith had earlier followed the lead of Sherrington and 
Woodworth in considering sequences of behavior as composed of pre- 
current or preparatory responses and consummatory responses. 26 Such 
acts appear from the outside to be intentional, for the earlier adjustments 
clearly are in readiness for the consequences th£|t are to follow. These 
anticipatory responses or readiness reactions are said to be conditioned 
to maintaining stimuli. 

The typical case is that of the hungry rat running down an alleyway 
to food at the end. The activity is maintained by the internal stimuli 
aroused by food deprivation to which running and eating behavior have 
been conditioned in the past. That is, the rat found food at some previous 
time after running while hungry. These internal stimuli, plus the stimuli 
from the runway (if it has been previously a path to food), maintain 
the running of the animal against competing responses, such as stopping 
to explore. Anticipatory salivation or chewing movements give directional 
character to the behavior. All this food anticipation is fulfilled if there 
is food at the end of the maze. Because the stimuli of hunger and antici- 
pation are now removed, and the animal is out of the maze, all the 
learning is intact for a new trial at a later time. This is about as complex 
a description as Guthrie ever indulges in, and the details are not taken 

24 F. D. Sheffield (1949). 

25 Guthrie ( 1935), pages 202-211. 

26 Smith and Guthrie ( 1921 ); Sherrington ( 1906); Woodworth ( 1918), 
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as seriously as Hull, for example, takes them. But the paradigm pro- 
vides a way of talking about human intentions and purposes also. 

The essence of an intention is a body of maintaining stimuli which may or 
may not include sources of unrest like thirst or hunger but always includes ac- 
tion tendencies conditioned during a past experience — a readiness to speak, a 
readiness to go, a readiness to read, and in each case a readiness not only for the 
act but also for the previously rehearsed consequences of the act. These readi- 
nesses are not complete acts but they consist in tensions of the muscles that will 
take part in the complete act. 27 

This statement goes a long way toward the point of view which those 
with very different theories of learning accept. The only feature which 
keeps it within the bounds of Guthrie's theory is that all the readinesses, 
including the readiness for the “previously rehearsed consequences of 
the act,” are said to consist in tensions in the muscles. This assumption, 
characteristic of the behaviorist position, remains in the realm of con- 
jecture rather than of demonstration. 

The Control of the Learning Process 

It is part of the charm of Guthrie's writing that it is closely in touch 
with life and provides amusing but cogent suggestions for meeting the 
problems of animal training, child-rearing, and pedagogy. This practical- 
ity is not a necessary characteristic of the system, for if one seriously 
attempted to provide evidence for the theory he would be buried in 
the midst of the precise movement correlates of measurable stimuli and 
the muscular tension accompaniments of preparatory adjustments. But 
the system is not intended to be taken seriously in that sense. As long 
as a convenient way of talking about things can be found without seem- 
ing to contradict the system, quantitative- precision is not essential. It is 
Guthrie's conviction that scientific laws to be useful must be approxi- 
mately true, but they must also be stated coarsely enough to be teach- 
able to freshmen. 28 

Most of the practical advice which Guthrie gives is good advice, 
and he succeeds in making it flow from the theory. Consider the follow- 
ing example: 

The mother of a ten-year-old girl complained to a psychologist that for two 
years her daughter had annoyed her by a habit of tossing coat and hat on the 
floor as $he entered the house. On a hundred occasions the mother had insisted 
that the girl pick up the clothing and hang it in its place. These wild ways were 
changed only after the mother, on advice, began to insist not that the girl pick 
up the fallen garments from the floor, but that she put them on, return to the 

27 Guthrie (1935), pages 205-206. 

28 Guthrie (1936a). 
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street, and re-enter the house, this time removing the coat and hanging it 
properly. 29 

Why was this advice given? Behavior is in response to stimuli. 
Hanging up the coat and hat was in response to her mothers pleading 
and the sight of the clothing on the floor. In order to attach the desired 
behavior- to its proper cues, it was necessary to go outside and come into 
the house, so that entering the house became the cue for hanging up the 
coat and hat. 

The following statements represent the kind of suggestions which 
recur in his writings: 

1. If you wish to encourage a particular kind of behavior or dis- 
courage another, discover the cues leading to the behavior in question. 
In the one case, arrange the situation so that the desired behavior occurs 
when those cues are present; in the other case, arrange it so that the 
undesired behavior does not occur in the presence of the cues. This is 
all that is involved in the skillful use of reward and punishment. A stu- 
dent does not learn what was in a lecture or a book. He learns only what 
the lecture or book caused him to do. 80 

2. Use as many stimulus supports for desired behavior as possible, 
because any ordinary behavior is a complex of movements to a complex 
of stimuli. The more stimuli there are associated with the desired be- 
havior, the less likely that distracting stimuli and competing behavior 
will upset the desirable behavior. There would be fewer lines confused 
in amateur theatricals if there were more dress rehearsals, since the cues 
from the stage and the actors are part of the situation to which the actor 
responds. Another way of putting this is to rule that we should practice 
in the precise form later to be demanded of us. 

EXPERIMENTS ON THE PUZZLE-BOX 

One of the serious lacks in the early history of Guthrie's proposals 
was the failure to set convincing experiments. Such an experiment 31 was 
completed between the two editions of his book on learning and provides 
a much more tangible ground on which to come to grips with both the 
strengths and the weaknesses of Guthrie's position. 

It is fitting that Guthrie and Horton should have chosen as a char- 
acteristic experiment the behavior of the cat in escaping from a pu zzl e- 

sa Guthrie ( 1935), page 21. 

30 Guthrie ( 1942), page 55. 

31 Guthrie and Horton ( 1946). 
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box, because this situation had already been the occasion for both ex- 
periment and theory. Thorndike’s classical experiment has already been 
cited (page 16). This gave the send-off to Thorndike’s theory by con- 
vincing him that little of the cat’s behavior was mediated by ideas and 
much of it controlled by the influence of rewards. Adams’ (1929) repe- 
tition of Thorndike’s work gave much more evidence for ideas. These 
two experiments represent extremes of theory in addition to providing 
detailed descriptions of behavior as background for the Guthrie and 
Horton experiment. 

Adams 9 Repetition and Supplementation of Thorndike's 

Work 

Adams (1929) repeated Thorndike’s experiment as faithfully as 
possible, noting details of the cat’s behavior instead of relying solely on 
time curves. He also designed some other experiments, of which the 
following is typical. 

A piece of liver is suspended from the top of a wire cage, so that the 
liver rests on the floor inside the cage, loosely held by the thread. A 
hungry cat in the room with the cage, but outside it, sees the liver and 
walks over to the cage. It hesitates for a time and its head moves up 
and down as though it is studying the string. Then it jumps on top of 
the cage, catches the string in its mouth, raises the liver by joint use of 
mouth and paw, and leaps down with the stick at the end of the string 
in its mouth. The liver is pulled to the top of the cage, but it is unfortu- 
nately torn off. In another trial the cat is successful in obtaining the 
liver by pawing the stick, taking it in its mouth and jumping down again. 
This time the liver has been more securely fastened, and the response is 
successful. 

Adams argues that this kind of behavior is very different from the 
random slashing about described by Thorndike and appears by analogy 
to be controlled by ideas. If the cat has used its past behavior and habits, 
it has used them in a new situation in a remarkably appropriate manner. 

The Guthrie and Horton Experiment 

Because of their wish to record details of movement rather than 
to score achievement in some other manner, a special problem-box was 
designed which permitted tire cat to be fully observed during the period 
prior to solution and its exact posture to be recorded photographically at 
the moment it activated the release. The release mechanism was a small 
pole set in the midst of the floor of a glass-covered cage with exit door 
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in the front The animal entered through a starting box and tunnel at 
the rear. If it touched the pole in any manner at all, the front door 
was opened and the animal could escape into the room. A camera was 
operated as the door was opening, so that a photograph of the animal was 
obtained at the moment of release, while it was still in contact with the 
pole. 

In each of three preliminary trials the animal entered the box 
through the tunnel and made its way out the front door, which was left 
ajar, to find a bit of salmon on the table top in front of the box. The 
first of the regular trials followed. During the regular trials, the experi- 
menters kept notes of the animal’s behavior as it entered the box through 
the tunnel at the rear, and the camera recorded the exact time and po- 
sition when it struck the release mechanism. 

The results are remarkable for the amount of repetitiousness in each 
cats successive release. A cat which bites the pole may do so time after 
time; one which has escaped by backing into the pole may back almost 
endlessly in its efforts to escape by the same movement. Others use front 
paws or hind paws, or roll against the pole. The cat, in full agreement 
with the theory, learns the method of escape in the first trial and then 
repeats what is essentially the same solution time after time. Some cats 
have several modes of escape which they use at different times, or 
they have one type of escape for a long time and then shift to another. 
These exceptions to the principle of doing what was done the last time 
are accounted for on the basis of a different entrance, which changes the 
stimulating conditions; the result of accidental distractions; or, having 
been in the box a long time and failing to operate the release by a familiar 
method, some new method may have superseded the familiar one. The 
fact that the last movement — the movement at the time of release — is the 
most stereotyped is in agreement with the principle that such an act will 
remain intact because nothing can interfere with it, the cat leaving the 
situation as soon as he strikes the release mechanism. The fact that the 
food reward is inconsequential is shown by the cats often failing to eat 
the fish or to lick the milk provided for it. 

Guthrie and Horton say that they have seen in the behavior of their 
cats all that Adams and Thorndike report. But they have also seen a 
degree of stereotypy which points strongly to the tendency for behavior 
to repeat itself under similar conditions. Some of the tracings of photo- 
graphs of Guthrie and Horton’s cats are shown in Figure 6. 

This behavior is so convincing that it has to be acknowledged. It is 
coherent with all that Guthrie has been proposing in his theory. That is 
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surely as much as could be wished for by a theorist from a series of 
experiments. 

Why did these cats learn so much more easily than Thorndike’s? 
The answer given is plausible. These cats always found the release mech- 
anism available in exactly the same form, and readily operable. Thorn- 
dike’s release mechanism was more difficult to operate and was probably 
not always in precisely the same position. Hence Thorndike’s cats had to 



FIGURE 6. Alternative stereotyped responses of a single cat escaping from the 
puzzle box by touching the pole. Responses K-9 through K-12 illustrate one type of 
response used frequently by cat K. The remaining pairs of pictures show four other 
response types used by the same cat. Reproduced from tracings of photographs taken 
automatically at the time of release. From Guthrie and Horton ( 1946). 

learn a series of habits rather than a single habit. When Guthrie and 
Horton’s cats failed to operate the mechanism by a familiar method 
they, too, adopted a new method. Stereotypy was shown because stereo- 
typy worked. 

Guthrie and Horton are clear that they are not proposing a test of 
the cat’s intelligence. They could easily have devised an experiment in 
which there would have been much less stereotypy. But the point is 
that as far as the learning of movements is concerned, the animal tends 
to do what it last did in the situation. If the situation forces it to do some- 
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thing else, it will do something else. Cats do not jump at the place where 
a bird was previously caught if there is no bird there, though they may 
lie in wait at the same spot. 

The experiments are accepted by the authors as fully justifying the 
theory which Guthrie has all along expounded: 

It has been our conclusion from our observation of this series of experiments 
that the prediction of what any animal would do at any moment is most securely 
based on a record of what the animal was observed to do in that situation at its 
last occurrence. This is obviously prediction in terms of association. 32 

Some Objections to the Guthrie and Horton Experiments 

as Representative of Animal Learning 

The experiments of Guthrie and Horton are appropriate ones for 
showing that, under limiting conditions, learned responses may show a 
high degree of stereotypy. To go beyond this and assume that these ex- 
periments provide the typical case for animal learning is misleading. 

According to the authors themselves, varied behavior supervenes if 
the response of the animal does not release it from the box fairly quickly. 
The stereotypy appears to result in part at least because the problem is 
an easy one. It may be mastered in a single trial, and the later trials 
are then merely the repeated performances of a learned act. There is 
little remarkable about easy learning taking place promptly, or about its 
being repeated when there is nothing to block it and reward to sustain 
it. 33 

Some critics have argued that the restrictions on the photographs 
violate Guthrie’s insistence that learned behavior should not be defined 
by outcome or effect, because the moment at which the picture was 
snapped was defined by the consequences of the activity: releasing the 
door. 

The effect of this restriction in picture taking could be seen more clearly in 
an experiment that required the animal to stand on his hind legs and reach out 
of the upper corner of the cage in order to press a button to release the door and 
take the picture. In this way we further restrict the set of possible responses and 
achieve added stereotypy. On the other hand, other experimental situations could 
be constructed, which would reduce stereotypy. In either case, however, what is 
procured are data concerning responses defined in terms of effect. 34 

It is not quite clear how much Guthrie and Horton wish to make of 
the amount of stereotypy they observed. Commenting on an earlier and 

82 Guthrie and Horton (1946), page 42. 

88 Baffling situations also lead to stereotypy. E.g., Hamilton (1916). 

84 Mueller and Schoenfeld ( 1954), page 358. 
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related experiment done with guinea pigs by Muenzinger, Koerner, and 
Irey (1929), Guthrie writes: 

These authors conclude that the mechanization of habitual movement “is 
still accompanied by variability of its pattern . . .” and that accessory move- 
ments, “while on the whole exhibiting much plasticity, show some mechaniza- 
tion of a brief and unstable kind.” 

The account given by Muenzinger, Koerner, and Irey is quite consistent 
with what Horton and I observed in our cats. We were interested in the routiniz- 
ing, they in the exceptions and new behavior. We were convinced that whole 
segments of movement appear and are repeated all-or-none on succeeding 
trials. 85 

Guthrie and Horton certainly succeeded in showing a considerable 
amount of repetitiveness and stereotypy in learned behavior. This leads 
them to conclude, as indicated in the earlier quotation, that prediction is 
best made on the basis of the last prior occurrence, and that such a pre- 
diction is obviously one in terms of association. We may question this 
last conclusion. To prophesy that a man who owns and wears only blue 
ties will wear a blue tie tomorrow is an actuarial prediction pure and 
simple, with no theory in it. It is logically similar to the prediction of an 
insurance company that a given percentage of people will die at a given 
age. The only assumption is a certain uniformity of events when taken 
en masse — an assumption scarcely attributable to associationism. 

To account for learning, rather than for the repetition of learned acts, 
one has to account not for uniformity but for change. Upon learning in 
this sense the Guthrie and Horton experiments throw little light. Most 
of the change in behavior took place in the neglected early trials in 
which the cats learned to find their way out of the box through the glass 
door. There was, to be sure, the supplementary learning to use the pole 
to open the door. A harder but possibly clearer problem, like the string- 
pulling problems studied by Adams, perhaps would be solved by fewer 
animals, but those which solved them could use the solutions in novel 
situations. Thus the chosen learning situation has a great deal to do with 
what aspects of behavior will be revealed. The problem-box of Guthrie 
and Horton, which at first blush appears to lay bare the primitive nature 
of learning, may in fact be a highly specialized situation poorly designed 
to show the behavior of the cat as it would solve a problem which, while 
harder for it, was more appropriate to it in clarity of cue-response rela- 
tionships. 

85 Guthrie (1952), pages 271-272. 
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STEPS TOWARD FORMALIZATION AND ADDITIONAL 
EXPERIMENTAL TESTING 

Since the first edition of this book appeared, some of Guthrie’s 
students have undertaken the task of clarifying and formalizing his 
theory, and of designing new experiments crucially related to it. The 
theory has gained support also from other quarters, and has now become 
a serious contender among contemporary learning theories, as it was there 
suggested that, with such help, it might become. 

Guthrie’s statements have been collected and cast into postulational 
form by Voeks (1950). She goes on to state eight theorems open to experi- 
mental test, and has herself provided some confirming experimental re- 
sults. Her four basic postulates are here quoted verbatim: 

Postulate 1: Principle of Association 

(a) Any stimulus-pattern which once accompanies a response, and/or im- 
mediately precedes it (by y 2 seconds or less), becomes a full-strength direct 
cue for that response. ( b ) This is the only way in which stimulus-patterns not 
now cues for a particular response can become direct cues for that response. 

Postulate 2: Principle of Postremity 

(a) A stimulus which has accompanied or immediately preceded two or 
more incompatible responses is a conditioned stimulus for only the last response 
made while that stimulus was present, (b) This is the only way in which a 
stimulus now a cue for a particular response can cease being a cue for that 
response. 

Postulate 3: Principle of Response Probability 36 

The probability of any particular response's occurring (P) at some specified 
time is an increasing monotonic function ( x ) of the proportion (N) of the stimuli 
present which are at that time cues for that response. (P = N*) . 

Postulate 4: Principle of Dynamic Situations 

The stimulus-pattern of a situation is not static but from time to time is 
modified, due to such changes as result from the subjects making a response, ac- 
cumulation of visceral changes and other internal processes of 

the subject, i !•<;:: ■■!<■■ n v ■!» v Y, or uncontrolled variations in the stimuli 
present 

These four postulates (and the theorems related to them) deal with 
isolated responses as the essential core of Guthrie’s theory. Unfortunately, 
Voeks’s theorem system has not yet been extended to deal systematically 
with the puzzling problems posed by various arrangements of reward and 

36 Some of the subscripts have been omitted from the symbols in Postulate 3 as un- 
necessary here. 
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punishment, with anticipatory responses, with the integration of acts 
through movement-produced stimuli. Nevertheless, the start made by 
Voeks is a very useful one, and her experimental tests are cogent. 

Her first experimental test 37 (completed before the system was 
formalized, but consistent with the formalization) deals with the pre- 
diction of behavior in a maze. She studied the individual responses at 
each choice-point of 57 human subjects learning a raised relief finger 
maze and a punch-board maze. She tested which of two predictions was 
the more accurate: (1) prediction based on the frequency of prior 
choices at the point of choice, and (2) prediction based on postremity 
(i.e., the last choice that was made). Postremity won out easily. The 
success of prediction by postremity was most striking when the prediction 
disagreed with that on the basis of frequency. For 56 of the 57 subjects, 
postremity predicted better than frequency for those choices in which 
the predictions disagreed. The difference was statistically highly signifi- 
cant. These results are, of course, in agreement with Guthrie’s theory, 
especially as stated in Voeks’s Postulate 2. 

The major puzzle is why the maze was learned at all. Why was not 
the last trial chiefly a repetition of the first, with some minor fluctuations 
according to Postulate 4? The explanation is not very difficult for the 
high-relief maze, for the last response was always leaving the choice- 
point by the true path. The explanation is more difficult for the punch- 
board maze, because a "non-correction” method was used. That is, if 
the "wrong” hole was punched, this was signalled to the subject, but he 
went on to the next choice without punching the "right” hole. Let us 
look at Voeks’s explanation of the learning of the punch-board maze: 

. . . Now, in the learning of the punch-board maze, S may insert his 
stylus into the wrong hole of a pair, e.g., pair 8. A stimulus then is presented 
(the sound of the buzzer in our experiment, E saying "wrong” in others, etc.) 
which may cause S to withdraw his stylus and make additional responses on that 
trial to that choice point. These responses may involve, for instance, the S’s say- 
ing to himself "Eight, not this hole, that one is right,” or even making incipient 
or possibly overt movements toward the other correct hole while looking at the 
pair. If this has been the case, the next time S comes to the pair of holes, he will 
say "Eight” (as instructed), and it is expected he will start toward the incorrect 
hole, draw his stylus back, say "No, not this hole; that one is right,” and then 
make the previously established conditioned response to the stimulus "That-is- 
right,” i.e., the response of inserting his stylus into the hole at which he has just 
looked and said "That is right.” Thus another error may be eliminated. This 
again is in accordance with the principle of postremity. 38 

37 Voeks (1948). 

Voeks (1948), page 505. 
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In fairness to the experimenter it must be pointed out that this 
change of response would have been recorded as a failure of prediction 
by postremity, although on the basis of her theoretical analysis it was 
no failure at all. Hence her successful predictions by postremity are 
on the conservative side. The explanation offered for the actual learn- 
ing (the elimination of an error) seems to come very near to an ex- 
planation according to knowledge of results or effect, and gives some- 
what specialized meaning to the concept of “doing what you last 
did." 

The second experiment 39 studies conditioned eyelid responses of 
human subjects, measuring both the occurrence of the responses and 
their amplitudes. The question asked is this: “Is a stimulus-response 
connection gradually strengthened by reinforcement, or is a stimulus- 
response connection established suddenly, in all-or-none fashion?” Hull 
is said to favor gradual strengthening, Guthrie all-or-none appearance of 
responses, according to Voeks’s postulates 1 and 2. 

Two tests were made of the theory of gradual strengthening, the 
first studying amplitude, the second frequency, Beginning with the first 
conditioned response (CR), succeeding responses for each subject were 
divided into fourths. For only 6 of 32 subjects was there a progressive in- 
crease in amplitude of CR from quarter to quarter. In general, increases 
from quarter to quarter were not statistically significant, although for 
25 of the 32 subjects the last CR was larger than the first (a significant 
difference). Amplitude changes thus gave little support to Hull’s theory 
of gradual increase in strength with repeated reinforcement. The fre- 
quency results were more strikingly in Guthrie’s favor, because a response 
was acquired in all-or-none fashion. No subject showed an increase in 
frequency of CR from quarter to quarter following his first CR. Half 
the subjects gave CR’s on every trial after their first one, and there were 
only a few lapses for the others. 

How do Voeks’s results square with the many acquisition curves 
published for CR’s? It is well known that curves for groups of subjects 
show characteristics very different from curves for individual subjects. 
Voeks has very convincingly shown that response acquisition may be 
all-or-none, and yet a group learning curve will show a gradual slope. 
The accompanying learning curve (Figure 7) results, for example, when 
the probability of response is plotted as a group function for 15 sub- 
jects all of whom had jumpwise curves, that is, all of whom responded 
consistently with CR’s after the first CR was made. The form of the curve 
39 Voeks (1954). 
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is determined solely by the trial on which the first CR happened to ap- 
pear for different subjects. 40 

Voeks predicted jumpwise curves of learning under the very uni- 
form conditions of stimulation that she arranged to produce, and her 
predictions were well borne out. A jumpwise curve is, of course, ideal for 
a test of the postremity principle: if there is an unbroken run of no 
responses, the prediction by postremity is perfect during the course of 
this run; if this is followed by an unbroken run of conditioned responses, 



FIGURE 7. Misleading effect of combining individual learning curves. The group 
curve results from combining the curves of 15 subjects all of whom had jumpwise 
curves, i.e., a run of no responses followed by consistent responding. After Voeks 
(1954), page 145. 

prediction by postremity is perfect then, too. The prediction fails for only 
one trial (that on which the conditioning occurs!). By the test of pos- 
tremity prediction, Guthrie’s theory was beautifully confirmed. Of the 
total predictions made, 84.6 per cent were correct. 

What are we to do with the fact that an unconditioned response of 
the eyelid did occur (within y 2 se.cond of the conditioned stimulus) on 
every trial from the beginning? Does not Voeks’s Postulate 1 lead us to 
expect a conditioned response right away? Why does the CR wait for 
several trials (as many as 10) before it makes its appearance? Perhaps 
there is something to the group curve after all, showing that the proba- 
bility of giving the first CR increases with the number of reinforcements. 

40 Not all individual conditioning curves are of Voeks’s jumpwise type. Many acquisi- 
tion curves showing gradual acquisition have been published for single learners: dogs, 
rats, human subjects. For individual curves of eyelid conditioning (the response Voeks 
studied) see Hilgard (1931^, page 14, Cohen, Hilgard, and Wendt ( 1933), page 64. 
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The postremity principle cannot predict when the first CR appears, be- 
cause its appearance always violates the principle (at least, so far as 
recorded responses are concerned). 

This leads us to consider another postulate. There is a frequency 
principle in Guthrie’s system, stated in Voeks’s Postulate 3. It is the 
principle that cues are assimilated to a response with practice, and, con- 
versely, cues are alienated during forgetting. According to Guthrie: 

The strengthening of an S-R connection with repetition may very possibly 
be the result of the enlistment of increasing numbers of stimuli as conditioners. 41 

But why should practice make the effect increasingly certain? Is it not quite 
possible that on successive practice periods more and more conditioners are 
enlisted, so that after twenty periods there is a high probability that the cue will 
have enough support to be effective? 42 

Although this interpretation is very plausible, it makes it difficult 
to find a crucial test of Guthries theory. If one-trial learning suffices, then 
either the basic principle of association, or the related principle of 
postremity, is confirmed; if learning is gradual, then the principle of 
probability is confirmed. Only as the principles are made more precise 
will it be possible to find exceptions and hence improve the theory. 
Estes (1950) has given a more precise statement of Guthrie’s probability 
principle and worked it out in a form suitable for critical testing. He 
regards a stimulus, or stimulating situation, as a finite population of rela- 
tively small, independent, environmental events, of which only a sample 
is effective at any given time. Each occurrence produces conditioning of 
the momentary effective sample of stimulus elements. Estes acknowl- 
edges that the line of argument is the one developed by Guthrie. 43 We 
shall return to the details of Estes’ theory later, as we consider other 
mathematical and probabilistic theories. 

The work of Voeks and of Estes well illustrates the steps needed to 
make Guthrie’s theory more suitable for experimental testing. While 
the theory is itself undergoing formalization, a number of aspects of it 
are undergoing experimental tests. It is difficult to summarize these 
briefly, because most of the experiments require, in addition to the pre- 
sentation of their data, the elaboration of an argument to show how the 
data lead to a preference for one interpretation over another. A few rep- 
resentative studies are listed in Table 1 merely to indicate the kind of 
work that is going on, and to provide a “search-pattern” for those who 
wish to seek out the newer studies that are appearing right along. 

41 Guthrie { 1930), page 420. 

42 Guthrie ( 1935), page 100. 

48 Estes ( 1950), page 100. 
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Enough of the current studies favor Guthrie’s theory to insure that 
it will be given serious attention as an active contemporary theory of 
learning. 


ESTIMATE OF GUTHRIE’S POSITION 

Guthrie’s Position on Typical Problems of Learning 

By way of comparative summary, how Guthrie stands on the several 
representative problems of learning will be briefly stated. 

1. Capacity. Problems of capacity are not formally treated, although 
species differences are recognized and allowance is made for maturation 
as a determiner of many classes of acts. 44 Presumably any response which 
the organism can make can become associated with any stimulus to which 
he is sensitive — a generalization about the possibility of learning which is 
reminiscent of what Thorndike says about associative shifting. If pressed, 
Guthrie could find a basis for differences in capacity both in the differ- 
entiation of movement and in the discrimination among proprioceptive 
cues. All animals are not equally versatile and equally equipped with 
receptors. 

2. Practice. Practice assimilates and alienates cues, until a whole 
family of stimulus combinations comes to evoke a whole family of re- 
sponses which lead to the outcome socially described as successful per- 
formance. Because skill represents a population of habits, learning ap- 
pears to accumulate with repetition, although basically each individual 
habit is learned at full strength in a single repetition, 

3. Motivation , Motivation affects learning indirectly through what 
it causes the animal to do. Reward is a secondary or derivative principle, 
not a primary one as in Thorndike’s system. Reward works because it 
removes the animal from the stimulating situation in which the ‘correct” 
response has been made. It does not strengthen the “correct” response, 
but prevents its weakening because no new response can become at- 
tached to the cues which led to the correct response. Thus there is a 
relative strengthening, because responses to other cues get alienated. 

Punishment does at once several different things. In general, its ef- 
fects for learning are determined by what it causes the organism to do, 
according to the principle that the best predictor of learning is the 
response in the situation that last occurred. We may distinguish four 
cases: 

44 Guthrie (1935), pages 18, 38. 
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a. Mild punishment may be merely exciting, and enhance ongoing 
behavior without disrupting it. 

b. More intense punishment may break up a prior habit by leading 
to incompatible behavior in the presence of the cues for the earlier habit 

c. Continuing punishment acts like a drive, producing maintaining 
stimuli that keep the organism active until it finds relief. Then the conse- 
quence for learning is really that of reward: the act that leads to safety 
is rewarding because it terminates the punishment and by removing the 
maintaining stimuli protects from unlearning the activities carried on in 
the presence of those stimuli. 

d. Stimuli that have previously accompanied the punishment pro- 
duce behavior that formerly occurred following the punishment itself. 
Here we have an illustration of anticipatory response, essential to avoid- 
ance behavior. The cues to avoidance must earlier have been present 
at the time of punishment for this anticipation to occur. 

4. Understanding. Concepts like “insight” are handled in a derisive 
manner, although it is recognized that learning with foresight of its 
consequences may occur. The tendency is to talk down such learning, 
however, just as Thorndike did, and to emphasize the mechanical and 
repetitious nature of most human as well as animal learning. Such learn- 
ing with intention and foresight as does occur is explained on the basis 
of conditioned anticipatory or readiness reactions, based upon past ex- 
perience and hence not contradicting association principles. 

5. Transfer. Learning transfers to new situations because of com- 
mon elements within the old and new. In this the position is rather like 
Thorndike's. Stress is laid, however, on the identity being carried by way 
of common responses evoked, the proprioceptive stimuli being sufficiently 
similar from responses to a variety of stimuli to evoke common condi- 
tioned responses. The emphasis upon movement-produced stimuli thus 
represents Guthrie's supplementation to Thorndike. 

Because of his principle of responses being conditioned to all ad- 
ventitious contiguous stimuli, Guthrie expects rather little transfer and 
is, in fact, rather extreme about it. The only way to be sure to get desired 
behavior in a new situation is to practice in that new situation as well. 
To be able to perform in a variety of situations, you have to practice in 
a variety of situations. 

6. Forgetting. Learning is said to be permanent unless interfered 
with by new learning. Hence all forgetting is due to the learning of new 
responses which replace the old responses. It may take place gradually 
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for the same reason that skills may be acquired gradually: remembering 
depends upon many habits to many cues, and subhabits may drop out 
gradually as subcues become attached to new responses. 

By contrast with Thorndike, Guthrie is an avowed behaviorist who 
makes it a matter of some importance to get rid of subjective terms, to 
refer, for example, to inner speech instead of to thinking. The emphasis 
upon movement-produced stimuli is part of this older behaviorist tradi- 
tion which Guthrie carries into the present. While an orthodox behavior- 
ist in these respects, his is an informal behaviorism, with little of the 
brittleness of earlier Watsonianism. 

Invulnerability of the Theory a Cause for Skepticism 

The uncertainty that exists in practically all learning experimenta- 
tion makes the fact-minded psychologist suspicious of a finished system 
at this stage of our knowledge. While scientific truth must eventually 
have exceptionless validity — if its laws are truly lawful — the history of 
our most advanced sciences shows that their theories move by successive 
approximations, and the most advanced theories do not emerge full blown 
from the head of the theorist. Even as loose a system as Thorndikes went 
through revisions on the basis of evidence regarding the effects of pun- 
ishment; Hulls system, the most carefully worked out of any, was con- 
tinuously being revamped to meet experimental fact. One of the sources 
of uneasiness about Guthrie’s system lies in its assured answers to the 
problems of learning — answers that have remained unchanged through a 
quarter of a century of the most active psychological experimentation 
we have ever known. Experimental controversies finally get resolved as 
we learn more about the independent variables that modify the measured 
consequences. No matter how these issues get resolved, Guthrie’s system 
remains unchanged. Either the theory is a miraculously inspired one or 
it is not stated very precisely, and hence is not very sensitive to experi- 
mental data. 

Criticism of Guthrie’s position is rendered somewhat difficult by the 
nature of the task which he sets himself. It is not quite clear whether 
he believes his system to have any responsibility with respect to details. 

The principle of association or conditioning is not an explanation of any 
instance of behavior. It is merely a tool by which explanation is furthered. A tool 
is not true or false; it is useful or useless. 45 

The paradox of the theory lies in the kind of sophistication implied 
in this statement, combined with a casualness which baffles the critic. It 
45 Guthrie (1935), page 232. 
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is not unfair to ask of the position that it substantiate its claims, which 
are, indeed, far-reaching and are competitive with the claims of others. 

Of the opposing points of view to which he objects, Guthrie gener- 
ously admits that their phenomena and the terms used to describe them 
are correct and useful for certain purposes. 46 What he says in essence is 
that what they do is all right for limited purposes, but it is not very help- 
ful so far as understanding ordinary learning is concerned. All other 
writers are said to be concerned with outcomes, in the form of success or 
goal achievement Under their theories, he says, the teacher must be a 
mere passive element in the situation and cannot be told how to influence 
the outcome. 47 This non sequitnr indicates a failure to take opposing 
claims seriously. Insight, if not predicted on the basis of past experience, 
must be in the category of luck, and hence lie outside of science. 48 While 
many of Guthrie’s observations are astute, a cavalier handling of serious 
alternatives to his own gives an impression of immutability inappropriate 
in a growing science. 

The Simplicity of the Theory May Be Illusory 

Certainly much of the fascination of Guthrie’s theory rests upon his 
ability to deduce a wide range of phenomena from the single principle of 
one-trial contiguous association. Parsimonious scientific theories are at- 
tractive, and this is the ultimate in reductionist theories. Although Voeks 
found it convenient to state four postulates, with proper definitions of 
the stimuli and responses entering into association, she would have found 
one to be enough, with the other three as corollaries. 

A painstaking search through Guthrie’s writings for careful defini- 
tions of stimulus and response, for distinctions between observables and 
constructs, for statements taking the form of predictions and those tak- 
ing the form of a posteriori explanations, has led recent critics to con- 
clude: 

While the principles of conditioning which he expands seem to have a 
parsimony that would be desirable in a theoretical formulation of behavior, a 
closer analysis reveals that a formidable set of additional assumptions and con- 
structs are required if his theory is to possess any real applicability to experi- 
mental data. 49 

46 See Guthrie’s comments on Tolman, Hull, Skinner and others, Guthrie (1952), 
pages 189-252. These pages are new to the revised edition of the book, although the 
orientation is unchanged from views expressed much earlier, e.g., Guthrie (1942), 
page 57. 

47 Guthrie ( 1942), page 57. 

48 Guthrie ( 1935), page 25. 

48 Mueller and Schoenfeld (1954), page 377. 
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It is undoubtedly true that many reviews of Guthrie in the literature have 
mistaken incompleteness for simplicity. 50 

Guthrie’s Contributions 

Despite his formal concern with the more hard-boiled aspects of 
scientific observation and scientific logic, Guthrie is without a peer in 
the use of anecdote and illustration to make pertinent comments about 
the activities of everyday life, including the symptoms found in the 
psychological clinic. This complicated material is talked about in dramati- 
cally simple terms, and his theory makes this kind of talk possible. The 
difference between plausibility and proof is one of the differences that 
hounds psychological science, and a theory that is not sensitive to experi- 
mental data will not be sensitive either to those clinical patients who do 
not get well following good advice. Nevertheless, there is something to 
be learned from Guthries type of psychologizing, and it is not a sufficient 
explanation to remark that he is a wise and experienced person, a shrewd 
observer of human beings. His kind of psychologizing is appealing enough 
to have led many promising young men and women to enter upon careers 
in psychology — a contribution to the field not to be overlooked. 

At the experimental level, Guthrie’s greatest contribution is to call 
attention to the large element of repetitiveness and stereotypy in behavior 
when the opportunities' are favorable to such monotonous behavior. The 
tendency toward smugness is easily overlooked, especially by those who 
choose to study the changes in behavior called learning. 

There are enough well-trained younger psychologists friendly to 
Guthrie’s ideas to undertake the task of increasing the rigor of his sys- 
tem beyond that yet attained. The promise of the system is great enough 
to make the effort worth while. 
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Chapter 4 

SKINNER’S OPERANT 
CONDITIONING 


I N a series of papers beginning in 1930, B. F. Skinner proposed a formu- 
lation of behavior which arose out of observations of animal perform- 
ance in a type of experiment that he invented: the bar-pressing activity 
of a rat in a specially designed box called (by others) the Skinner box. 
The experiments and theories were first brought together in book form in 
his Behavior of organisms (1938).JExperimentation has continued, much 
of it using a new organism (the" pigeon) and a new equivalent of bar- 
pressing (the pigeon’s pecking at a spot). The principles have become 
the basis for two textbooks of geperal psychology, Keller and Schoen- 
feld’s Principles of psychology ( 1950 ) and Skinner s Science and human 
behavior (1953). As an** avowed behaviorism making use of condition- 
ing principles, the system can be understood as a development of the 
ground broken by Thorndike, just as Guthries system can be so under- 
stood. But if Guthrie’s theory is thought of as a development of Thorn- 
dike’s associative shifting, ^Skinner’s operant conditioning is a development 
more along the lines of Thorndike’s learning by selecting and connecting 
under the law of effect* As we shall see, Skinner acknowledges two kinds 
of learning, just as Thorndike did, but, like Thorndike, he places more 
emphasis upon that kind of learning which is under the control of its 

consequences. ^ 

} 

, *• 

/respondent and operant behavior 

The greatest break with conventional stimulus-response psychology 
within Skinner s system is the distinction between respondent and oper- 
ant behavior. Conventional stimulus-response psychology enforced the 
dictum “no stimulus no response” by assuming the presence of stimuli 
when a response occurred though no stimuli were identifiable.|AIthough 

82 
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it was often convenient to talk about “random” or “spontaneous” re- 
sponses, it was not doubted that stimuli were present to elicit them, if 
the experimenter only had means of detecting them. Skinner finds this 
method of forcing facts both undesirable and unnecessary. He proposes 
that two classes of response be distinguished, a class of elicited responses 
and a class of emitted responses. 

Respondent and Operant Distinguished 

Responses which are elicited by known stimuli are classified as 
respondents. Pupillary constriction to light and the knee jerk to a blow 
on the patellar tendon serve as convenient illustrations. There is a sec- 
ond class of responses which need not be correlated with any known 
stimuli. These emitted responses are designated operants , to distinguish 
them from respondents. While the conventional treatment of such re- 
sponses is to consider them as respondents with unknown stimuli, Skinner 
holds the conviction that the stimulus conditions, if any, are irrelevant 
to the understanding of operant behavior. Because operant behavior is 
not elicited by recognized stimuli, its strength cannot be measured ac- 
cording to the usual laws of the reflex, which are all stated as functions 
of stimuli. Instead, rate of response is used as a measure of strength. 

An operant may, and usually does, acquire a relation to prior stimu- 
lation. In that case it becomes a discriminated l operant; the stimulus be- 
comes an occasion for the operant behavior, but- is not an eliciting stimu- 
lus as in the case of a true reflex. Skinner formerly extended the term 
“reflex” to cover the operant, although this proved somewhat awkward 
because several of the laws of the reflex do not apply/A simple illustration 
of an operant co-ordinated with a stimulus would be a reaction-time 
experiment as commonly conducted in the psychological laboratory. The 
correlation between stimulus and response may easily be changed, as by 
instructions to depress the key instead of lifting the finger from iy The 
relationship of latency of response to changes in stimulus intensity is 
very different for a discriminated operant from what it is for an elicited 
response, say thedid reflex to sound — a respondent. Such a comparison 
between a discriminated operant and a respondent has been made by 
Peak (1933). Although the characteristics of voluntary and reflex eyelid 
responses both were changed in response to sounds differing in loudness, 
the magnitudes of changes differed for the two kinds of response. 

: Most human behavior is operant in character. The behavior of eat- 
ing k meal, driving a car, writing a letter, shows but little of respondent 
character. The emphasis which Skinner places upon operant behavior is 
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appropriate if he is going to have something to say that applies in prin- 
ciple to the commonest forms of human (and animal) behavior. 

Two Types of Conditioning 

Related to the two types of response there are said to be two types 
of conditioning. 

The conditioning of respondent behavior is assigned to Type S, be- 
cause reinforcement is correlated with stimuli. The conditioned stimu- 
lus (e.g., a tone) is presented together with the unconditioned stimulus 
(e.g., food) and thus comes to elicit the response (e.g., salivation). The 
reinforcing event that interests Skinner is the presentation of the un- 
conditioned stimulus, not the response to it Pavlov's classical condi- 
tioning experiment is said to be of Type S. Its two laws are the Law of 
Conditioning of Type S, and the Law of Extinction of Type S. 1 The 
law of conditioning makes such conditioning depend upon the approxi- 
mate simultaneity of stimuli. The evidence for the existence of Type S 
conditioning is actually quite slim, because much that by experimental 
arrangement conforms to classical conditioning is better understood as 
a consequence of factors other than contiguity of stimulus and response. 
Hull, as we shall see, denied the existence of conditioning of Type S; 
Skinner admits that it does not appear experimentally in pure form. 2 
But Skinner does not attribute much importance to Type S in any case. 
Whether or not there is such conditioning does not matter too much to 
his system. 

Type R he believes to be much the more important. This is the con- 
ditioning of operant behavior, and the letter R is used to call attention 
to the important term in the correlation with reinforcement. In this case 
Jtis a r esponse which is correlated with reinforcement. T he experimental 
example which he originally used was lever-pressing. This response may 
be strengthened by following it with food. It is not the sight of the lever 
which is important; it is the pressing of the lever. The conditioned re- 
sponse does not resemble the response to the reinforcing stimulus; its 
relationship to the reinforcing stimulus is that it causes it to appear. In 
operant conditioning, conditioning of Type R, reinforcement cannot fol- 
low unless the conditioned response appears; reinforcement is contingent 
upon response. The two laws of Type R are not unlike those of Type S, 
including a law of conditioning and a law of extinction. The law of con- 
ditioning of Type R may be compared to Thorndike s law of effect: If 

1 Skinner ( 1938), pages 18-19. 

2 Skinner ( 1938), page 238. 
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the occurrence of an operant is followed by presentation of a reinforcing 
stimulus , the strength is increased , 3 Note that the reinforcing situation is 
defined by its stimulus; nothing is said about satisfying after-effects or 
about drive reduction. What gets strengthened is not a stimulus-response 
connection, because the operant requires no stimulus; this statement is 
unlike Thorndike: 

This dependence upon the posterior reinforcing stimulus gives the term 
operant its significance. . . . The operant . . . becomes significant for be- 
havior and takes on an identifiable form when it acts upon the environment in 
such a way that a reinforcing stimulus is produced. 4 

/ The mechanical arrangement under which Type R conditioning is 
usually demonstrated is that suggested by the quotation, a situation in 
which the response of the organism produces the reinforcing agent. This 
is what has been called instrumental conditioning to distinguish it from 
the arrangements of classical conditioning. 5 

One suggestion offered quite tentatively by Skinner 6 is that condi- 
tioning of Type S may be limited to autonomic responses, Type R to 
skeletal behavior. The crucial question is whether Type S occurs at all; 
if it does, it may well be limited to autonomic responses. That Type R 
occurs is evident, and most skeletal responses, including those obtained 
under the arrangements of classical conditioning, can easily be shown 
to conform to the pattern of discriminated operants. Actual experiments 
in which autonomic conditioning takes place (salivation, galvanic re- 
sponse) are full of indirect accompaniments of Type R. When the cir- 
cumstances seem almost ideal for demonstrating Type S conditioning, as 
in attempts to condition pupillary constriction by presenting a tone 
along with a light, it is extremely difficult to obtain any conditioning at 



REINFORCEMENT OF AN OPERANT 

In order to get at the quantitative relationships within operant con- 
ditioning, Skinner designed a special apparatus suitable for use with 

3 Skinner (1938), page 21. 

4 Skinner (1938), page 22. 

5 Hilgard and Marquis (1940), pages 51-74. For Skinner's laboratory methods, see 
Ferster (1953). 

6 Skinner (1938), page 112. 

7 Recent attempts at pupillary conditioning have been most unsuccessful, e.g., 
Wedell, Taylor, and Skolnick (1940), Hilgard, Dutton, and Heknick (1949), Young 
(1954). Some success has been reported, however, by Kotake and Mihama (1951) 
and Mihama and Kotake (1954), so the matter is still open. “Pure cases" of Type S 
conditioning are hard to find. The heart-rate conditioning of Notterman, Schoenfeld, 
and Bersh ( 1952) includes the possibility of operant intermediaries. 
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white, rats. It consists essentially of a darkened sound-resisting box in 
which the rat is placed. There is a small brass lever within the compart- 
ment which, if pressed, delivers a pellet of food. The lever is connected 
with a recording system which produces a graphical tracing of the num- 
ber of lever pressings plotted against the length of time that the rat is 
in the box. By a carefully controlled handling of the animals, remarkably 
consistent and "lawful” results can be obtained. Modifications of the ex- 
periment can be introduced so that food is not delivered every time the 
lever is depressed. The consequences of doing this and of making other 
changes in the situation have been systematically reported. 

The evident consequence of the reinforcement of an operant is to 
^increase the rate with which the operant response is emitted. This in- 
crease in rate of responding is an indicator of the increased probability 
of response, which is the appropriate measure of operant strength , sub- 
stituting for such measures as amplitude and latency, more appropriate 
for measuring the strength of a respondent. While the expression reflex 
strength was originally used for both respondent and operant behavior, 8 
in Skinner's later writings he tends to restrict the word "reflex” to its 
ordinary meaning of elicited reflexes (salivation, knee-jerk), and to con- 
ditioned reflexes based on them, that is, to respondent behavior. 9 ) 

Extinction as a Measure of Operant Strength 

When an operant is regularly reinforced, the rate of responding is 
^Interrupted by the activity of eating. Because the time-consuming act of 
eating does not occur during extinction, responses within extinction serve 
better than responses during conditioning as measures of the conse- 
quences of reinforcement. Two measures of responses within extinction 
are commonly used: rate of responding, and the total number of re- 
sponses before responding returns to its normal rate prior to conditioning. 
The total number of responses during extinction now often described as 
resistance to ’'extinction was formerly called the reflex reserve, 10 a figure 
of speech to describe a kind of reservoir of responses ready t6 be emitted 
during extinction. Skinner no longer believes the concept of a reflex re- 
serve to be very useful, although he appears to reject it because of his 
later interpretation of appropriate scientific concepts rather than because 
of any change in the factual relationships described. 11 

8 Skinner ( 1938 ) , page 20. 

9 Skinner (1938), pages 45-59. 

10 Skinner ( 1938), page 26. 

11 Skinner (1950), page 203. The original concept was not without its critics, e.g., 

Eflson (1939). 6 
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A single reinforcement suffices to produce a number of conditioned 
operant responses in extinction. After receiving a single pellet of food 
following one pressing of the bar, a rat may respond 50 or more times, 
yielding a typical extinction curve. Additional reinforcements add slowly 
to the number of responses during extinction* An extinction curve follow- 
ing a single reinforcement and one following 250 reinforcements are re- 
produced in Figure 8. The curves are cumulative ones, not to be con- 
fused with learning curves as usually plotted. As the curve levels off 
it means that responses have stopped; as the curve is constructed it is not 
possible for it to fall. 



FIGURE 8. Extinction of lever-pressing by rats following a single reinforcement 
and following 250 reinforcements. Although several times as many responses are 
emitted after 250 reinforcements as after a single reinforcement, it is evident that the 
total number of responses in the “reserve” does not increase in direct proportion to 
the number of reinforcements. Data from Skinner (1933) and F. S. Keller and 
A. Kerr (unpublished data), replotted from Skinner (1938), pages 87 and 91. Both 
curves have been moved back so that time is here shown as measured from the first 
response. 

(The considerable influence of a single reinforcement leads Skinner 
to conjecture that, if we were able to isolate a single operant, we might 
find an instantaneous change to a maximum probability /V This con- 
jecture, we may note, is very similar to that of Guthrie with respect to 
one-trial learning. The further conjecture that actual conditioning is 

12 Skinner (1953a), page 68. * 
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found to be gradual because several component operants are involved is 
also not unlike Guthrie. The residual difference is that the single event 
for Skinner is a reinforcement, whereas for Guthrie it is a mere occur- 
rence. 

/The importance of single reinforcements ( “one-trial learning”) is 
brought out in two arrangements which have come to prominence in 
Skinner’s later work. The first of these is the development of “supersti- 
tious” behavior. The second is the control of behavior through successive 
approximations.^ 

If there is only an accidental connection between the response and the ap- 
pearance of a reinforcer, the behavior is called “superstitious.” We may demon- 
strate this in the pigeon by accumulating the effect of several accidental contin- 
gencies. Suppose we give a pigeon a small amount of food every fifteen seconds 
regardless of what it is doing. When the food is first given, the pigeon will be 
behaving in some way — if only standing still — and conditioning will take place. 
It is then more probable that the same behavior will be in progress when the 
food is given again. If this proves to be the case, the “operant” will be further 
strengthened. If not, some other behavior will be strengthened. Eventually a 
given bit of behavior reaches a frequency at which it is often reinforced. It then 
becomes a permanent part of the repertoire of the bird, even though the food 
has been given by a clock which is unrelated to the bird’s behavior. Conspicuous 
responses which have been established in this way include turning sharply to 
one side, hopping from one foot to the other and back, bowing and scraping, 
turning around, strutting, and raising the head. The topography of the behavior 
may continue to drift with further reinforcements, since slight modifications in 
the form of response may coincide with the receipt of food. 13 

If, for example, three reinforcements were always required in order to 
change the probability of a response, superstitious behavior would be unlikely. 
It is only because organisms have reached the point at which a single contingency 
makes a substantial change that they are vulnerable to coincidences. 14 

Because the organism tends to do what it was doing at the time of 
the reinforcement, it is possible to lead an organism to do very much 
what the experimenter wishes by “baiting” each step of the way. Skinner 
and his collaborators have demonstrated great skill and ingenuity in 
animal training through the use of this method. )Some of the reports have 
appeared in the popular picture magazines, an early account showing a 
rat using a marble to obtain food from a vending machine, another show- 
ing pigeons playing a modified game of table tennis, a third in which a 
dog learns to retrieve a bone from a pedal-operated refuse can. The 
animal-training possibilities are almost unlimited. 15 How do single rein- 

SkinneFt 1953a ) , page 85. 

14 Skinner ( 1953a ) , page 871 

is The accounts have appeared many places, such as: Life , May 31, 1937; Science 
Newsletter , June 17, 1950; Newsweek , May 7, 1951. See also Breland and Breland 
(1951), Skinner (1951). 
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forcements bring such diverse behavior under control? Here is one demon- 
stration: 

An effective classroom demonstration of the Law of Effect may be arranged 
in the following way. A pigeon, reduced to 80 per cent of its ad lib weight, is 
habituated to a small, semi-circular amphitheatre and is fed there for several days 
from a food hopper, which the experimenter presents by closing a hand switch. 
The demonstration consists of establishing a selected response by suitable rein- 
forcement with food. For example, by sighting across the amphitheatre at a scale 
on the opposite wall, it is possible to present the hopper whenever the top of 
the pigeon’s head rises above a given mark. Higher and higher marks are chosen 
until, within a few minutes, the pigeon is walking about the cage with its head 
held as high as rouble Tn another demonstration the bird is conditioned to 
strike a marble pkuvc! 0:1 ilio floor of the amphitheatre. This may be done in a 
few minutes by reinforcing successive steps. Food is presented first when the 
bird is merely moving near the marble, later when it looks down in the direction 
of the marble, later still when it moves its head toward the marble, and finally 
when it pecks it. Anyone who has seen such a demonstration knows that the Law 
of Effect is no theory. It simply specifies a procedure for altering the probability 
of a chosen response. 16 

: Positive and Negative Primary Reinforcers 

A reinforcer is d efined by its effects^ Any stimulus is a r^irxforcer if 
it increases the probability of a response. The stimuli that happen to act 
as reipforcers fall into two classes: 17 

^1. A positive reinforcer is a stimulus which, when added to a situa- 
tion, strengthens the probability of an operant response. Food, water, 
sexual contact, classify as positive reinforcers. 

A negative reinforcer is a stimulus which, when removed from a 
situation, strengthens the probability of an operant response. A loud 
noise, a very bright light, extreme heat or cold, electric shock, classify as 
negative reinforcers. 

Notice that the effect of reinforcement is always to increase the 
probability of response. Punishment is something other than negative 
reinforcement as here defined. While a reinforcer is defined by its effects, 
this is not true for punishment. 'Punishment, according to Skinner and 
his followers, is defined as an experimental arrangement whose effects 
remain to be investigated empirically. The arrangement is the opposite 
of reinforcement (although the effects are not opposite), so that two 
main cases arise: (1) the presentation of a negative reinforcer, and (2) 
the removal of a positive reinforce^ We shall return later to a considera- 
tion of the consequences of punishment. 

/“~Why is a reinforcer reinforcing? One possibility is that reinforcement 


16 Skinner (1950), page 200. 

17 Skinner ( 1953a), page 73. 
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reduces a state of deprivation, that is, it meets a need, as when a hungry 
animal eats. The hungrier a bird, the greater the rate of its responses as 
a consequence of food reinforcement. Skinner recognizes, however, that 
reinforcement does not always reduce deprivation. Conditioning may oc- 
cur before any substantial change can take place in the internal state 
of the learner. 

All we know is that the type of event which reduces deprivation is 
also reinforcing. 

Skinner rather tentatively accepts an explanation in terms of evolu- 
tionary biology, but he does not find it of much help in the detailed f unc- - 
tional analysis of what actually occurs. 18 

r 

I Schedules of Reinforcement 

The reinforcement of operant behavior in ordinary life is not regular 
and uniform. The fisherman does not hook a fish with every cast of the 
line; the crop the farmer sows does not always yield a harvest. Hence 
the problem of maintaining or strengthening a response through inter- 
mittent reinforcement is more than a laboratory curiosity. 19 Skinner has 
explored extensively two main classes of intermittent reinforcement, now 
called interval reinforcement an k4atio reinforcement ... 

By interval reinforcement he means reinforcement given at fixed 
intervals of time, every 3 minutes or every 10 minutes, at the discretion 
of the experimenter. In the experiment itself this means the reinforce- 
ment of the first response that occurs after that interval of time, so there 
is actually some slight variation im s the time of reinforcement depending 
upon the activity of the learner/ Tnis arrangement, also called periodic 
reconditioning, or periodic reinforcement, delivers a standard amount of 
reinforcement per hour. It results in lawful rates of responding, the rate 
being proportional to the interval between reinforcements, the shorter 
intervals yielding more rapid response rates. Under standard conditions 
of experimentation and drive, for example, Skinner found about 18 to 20 
responses per reinforcement, over a considerable range of intervals| The 
uniformity of rates of responding is illustrated in Figures 9 and 10. 

[ The uniform number of responses per reinforcement is called the 
extinction ratio , that is, the ratio of unreinforced to reinforced responses. 
The size of the ratio does jjpt change much from one length of interval 
to another, provided^fkiVe remains constant. 

18 Skinner ( 1953a), pages 81-84. 

19 The expression "intermittent reinforcement” is a better descriptive term than 
“partial reinforcement,” an earlier name for the same arrangement (Hilgard and 
Marquis, 1940, page 347). 
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Although curves drawn in the scale o£ Figures 9 and 10 appear uni- 
form, there is actually a stepwise character to response when reinforce- 
ment is given at regular intervals. This follows because a response is never 
reinforced shortly after a prior reinforcement, and so, after a time, the 
Irate of responding is low following a reinforcement. This minor fluctua- 



FIGURE 9. Responses within one session of fixed-interval reinforcement. A pellet 
was delivered every 3, 6, 9, and 12 minutes, respectively. The more frequent the 
reinforcement, the more rapid the rate of responding, although each rate is relatively 
uniform. After Skinner (1938), as reproduced by Hilgard and Marquis (1940), page 
151. 

FIGURE 10. Responses within repeated sessions of fixed-interval reinforcement. 
Responses of the same rats whose records are given in Figure 9 are here accumulated 
for successive daily sessions. The uniformity of rate persists throughout. When ex- 
pressed as number of responses per reinforcement this rate is dccribcd as the "extinc- 
tion ratio.” After Skinner (1938), as reproduced by Hilgard and Marquis (1940), 
page 151. 

tion or galloping of the curves can be corrected by a modification known 
as variao le-interval reinforcement (also called aperiodic reinforcement) 
Under this arrangement, an average interval is substituted for a fixed 
interval, so that, while a response may be reinforced every 5 minutes on 
the average, in some cases the second reinforcement follows immediately 
upon an earlier reinforcement, and at other times it is longer delayed. 
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Under such a schedule, performance is remarkably stable and uniform, 
and highly resistant to extinction. An illustration is found in Figure 11. 
A pigeon has given as many as 10,000 unreinforced responses in extinction 
following such variable-interval reinforcement. 



^ one hour $\ 

FIGURE 11. Responses within variable-interval reinforcement. The curves are of 
the pecking responses of an individual pigeon reinforced at intervals ranging from 
10 seconds to 21 minutes, but averaging 5 minutes. Each of the sloping lines repre- 
sents 1,000 responses; the pen resets to zero after each 1,000. The whole record rep- 
resents some 20,000 responses in about 3 hours, with an average of 12 reinforcements 
per hour. Each reinforcement is repiesented by a horizontal dash. From Skinner 
(1950), page 208. 

The results under intermittent reinforcement show how input and 
output change under different schedules of reinforcement. While the 
constancy of the extinction ratio under conditions of interval reinforce- 
ment suggests a standard input-output ratio, something very different 
happens under another arrangement of intermittent reinforcement, that 
known as ratio reinforcement. 

In ratio reinforcement , instead of delivering a pellet of food at stand- 
ard intervals of time, a pellet is delivered after a standard number of 
responses. | In one study they were delivered after 16, 24, 32, 48, 64, 96, 
or 192 responses. These ratios have to be approached gradually, for such 
occasional reinforcements can maintain responses only if responses are 
already being emitted at a rapid rate. The end-products of such training 
are shown in Figure 12. The somewhat paradoxical finding now is that 
the less frequent the reinforcement, the more rapid the response! The 
extinction ratio has changed from the order of 20:1 under interval rein- 
forcement to 200:1 under ratio reinforcement. 

The steplike character of performance, mentioned in connection with 
reinforcement at fixed intervals, is pronounced with reinforcement at 
fixed ratios. After the burst of responding just before reinforcement there 
follows a period of very slow responding just after reinforcement. If the 
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ratio is made too high, there develop long periods of no response follow- 
ing reinforcement, a condition likened by Skinner to “abulia,” the in- 
ability to expend effort. He points out the analogy with the student who 
has finished a term paper, perhaps in a burst of speed as the deadline 
approaches, and then finds it difficult to start work on a new assignment. 2 ^ 
The pause after reinforcement may be eliminated by adopting 
variable-ratio reinforcement , that is, using a range of ratios around a mean 
value. Because the probability of reinforcement at any moment remains 
approximately constant, a uniform rate of responding ensues; because 
this probability is increased with rapid responding, the rate tends to be 
high. A pigeon may respond as rapidly as five times per second and keep 
up this rate for many hours. (This rate corresponds to that of the ticking 
of a watch. ) 



FIGURE 12. Responses within ratio reinforcement. Responses from individual rats 
reinforced every 48, 96, and 192 responses, as indicated by the horizontal lines. Un- 
der these circumstances very high rates of responding develop, the highest rate being 
found with the lowest frequency of reinforcement. After Skinner ( 1938 ) , as repro- 
duced by Hilgard and Marquis (1940), page 152. 


Schedules can also be brought under multiple control. For example, 
by selec tin g one of three colors to be projected on the key that the 
pigeon pecks, the pigeon can be put on fixed-interval, fixed-ratio, or 
variable-interval reinforcement, and will behave as it has learned to be- 
have under each of these schedules alone. Skinner has trained a pigeon 
up to nin e diff erent performances under nine different conditions of stimu- 
lation. 

f Another variation is a mixed schedule— a multiple schedule without 
external stimulus control That is, the pigeon has no differential external 
stimulus to determine whether it will be reinforced after responding 

20 Skinner (1953a), page 104. 
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30 times or after the passage, of 15 minutes. These conditions are alter- 
nated more or less at random.; The characteristic stable performance is 
thus described by Skinner: 

The bird runs off a number of responses at a high rate. If the current sched- 
ule is a ratio schedule, it will then be reinforced. If it is, however, an interval 
schedule, the bird will run somewhat past the number reinforced as a ratio and 
then will break suddenly to a zero rate of responding from which emerges a 
smooth interval |callop extending over the rest of the 15-minute period. That 
is, the bird “tests’? the schedule by running off a number of responses sufficient to 
achieve reinforcement if the schedule is a ratio; if no reinforcement is forth- 
coming, the performance “primes” the bird into the interval scallop. 21 

It is evident that the bird’s own behavior can function as a dis- 
criminative stimulus in controlling schedule performances. 

f These more recent experiments^while belonging to the same gen- 
eral family of experiments as those reported in The behavior of organisms 
(1938), show a shift of emphasis. Earlier Skinners attention was directed 
to the general course of a change in behavior over an experimental period 
or through a series of experimental sessions. His primary concern is now 
with the moment-to-moment behavior, or as he puts it, with the “fine 
grain” of the records rather than the over-all course of changes. With 
this change of emphasis the fundamental measure of response strength 
becomes the momentary probability of responding , rather than a more 
global “habit strength” or “reflex reserve.” In order to determine a mo- 
mentary probability, it is of course necessary to study frequency of re- 
sponse over more extended periods of time. Skinners emphasis upon 
frequency of response has persisted from his earliest writings, and he has 
recently ably defended his choice of this measure. 22 

Secondary Reinforcement 

The principle of secondary reinforcement is Simply this: 

A stimulus that is not originally a reinforcing one . . . can become rein- 
forcing through repeated association with one that is. 23 

That is, through conditioning, a stimulus acquires the power to condition 
Consider, for example, the acquisition of reinforcing power by the light 
in the following experiment.! When a rat presses a bar, a light comes on. 
After 1 second a pellet of food falls into the tray, reinforcing the bar- 
pressing. The light remains on for two seconds after the food appears. 
Several groups of animals are conditioned in this way, for 10, 20, 40, 80. 

21 Private communication dated December 28, 1954. 

22 Skinner (1953b). 

23 Keller and Schoenfeld ( 1950), page 232. 
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and 120 reinforcements, respectively. After conditioning, the rate of 
responding is reduced to a low level by extinction in the dark. Then 
pressing the bar again turns on the light for 1 second, but does not de- 
liver food. Under these circumstances responses again appear, showing 
that the light has acquired reinforcing properties. The number of re- 
sponses emitted in a 45-minute period increased with the number of 
prior pairings of light and food. 2 *j 

Other experiments suggest that the light will acquire secondary re- 
inforcing properties only if it appears before the reinforcing stimulus, and 
is thus either part of a chain or a discriminative stimulus. 25 

The following summary of secondary reinforcement shows its sys- 
tematic importance in operant behavior: 

1. A stimulus that occasions or accompanies a reinforcement acquires 
thereby reinforcing value of its own, and may be called a conditioned, secondary, 
or derived reinforcement. A secondary reinforcement may be extinguished when 
repeatedly applied to a response for which there is no ultimate primary reinforce- 
ment. 

2. A secondary reinforcement is positive when the reinforcement with 
which it is correlated is positive, and negative when the l ette r is negative. 

3. Once established, a secondary reinforcement is independent and non- 
specific; it will not only strengthen the same response which produced the 
original reinforcement, but it will also condition a new and unrelated response. 
Moreover, it will do so even in the presence of a different motive. 

4. Through generalization, many stimuli besides the one correlated with 
reinforcement acquire reinforcing value — positive or negative. 26 

One of the most important consequences of the development of sec- 
ondary reinforcement is the emergence of a class of generalized rein- 
forcers . 27 This generalization comes about because some secondary re- 
inforcers tend to accompany a variety of primary reinforcers. Money is 
a convenient illustration, because money provides access to food, drink, 
shelter, entertainment, and thus becomes a generalized reinforcer for a 
variety of activities. The so-called needs (need for attention, need for 
affection, need for approval ) lead to the kinds of persistent behavior best 
understood as a consequence of intermittent reinforcement, and the 
kinds of reinforcement sought are the generalized ones implied in the 
words “attention,” “affection,” and “approval.” Language behavior, such 
as calling objects by their correct names, tends to be reinforced by the 
generalized reinforcement from the listeners, .who show in indirect ways 
whether or not they understand and approve.(According to Skinner, even- 

2 *Bersh (1951). 

25 Schoenfeld, Antonitis, and Bersh (1950a); Webb and Nolan (1953). 

26 Keller and Schoenfeld (1950), page 260. 

27 Skinner (1953a), page 77-81. 
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tually generalized reinforcers are effective even though the primary 
reinforcers on which they are based no longer accompany them. 28 This is 
close to the functional autonomy of motives, as proposed by Allport. 29 

OTHER INFLUENCES AFFECTING OPERANT STRENGTH 

In the effort to remain descriptive and positivistic, Skinner has at- 
tempted to avoid the postulating of intermediaries not observed in his 
experiments, and to deal instead with a procedure that he calls functional 
analysis. A functional analysis is concerned with the lawfulness of re- 
lationships and the manner in which these relationships fluctuate under 
specified conditions. We have just reviewed one very important class of 
events in the functional analysis of behavior: reinforcements, as they 
affect operant strength. Two other classes of events are significantly cor- 
related with responses in the problems that Skinner studies. These are 
the classes known as drive and emotion . 

Drive 

Hours of food deprivation are important in determining the rate 
of responding in a Skinner box. The variable actually plotted is exactly 
that: hours of deprivation^ Is it necessary to say anything further? Is it 
important to talk about physiological needs, or hunger? Let us first cite 
some typical experimental results and then return to these questions. 

\ Eight rats learned to press a lever under interval reinforcement. They 
practiced daily but received the main portion of their rations on alter- 
nate days. The correlation between responding and hours of depriva- 
tion showed up in high and low rates on successive days. Two subgroups 
were created, matched according to their response rates during condition- 
ing. They were now extinguished on alternative days, one group when 
hunger was high, the other when hunger was low. The high hunger 
group yielded nearly double the responses of the low group on successive 
daily periods, although the two extinction curves showed similar curva- 
ture. 30 

Does deprivation affect the amount of contribution of each reinforce- 
ment, or does it merely affect the rate of responding during extinction? 
In a test of these relationships, rats were reinforced after different 
amounts of food deprivation, from one-half hour to 47 hours. Various sub- 

28 Skinner (1953a), page 81. 

29 Allport (1937). 

80 Skinner (1950), pages 201-202. 
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groups of animals received 1, 10, and 30 reinforcements, respectively. 
Then the strength of conditioning was tested by resistance to extinction, 
at a common level of deprivation (23 hours). Resistance to extinction 
was throughout correlated with the number of prior reinforcements, but 
not with the level of deprivation within reinforcement. 31 

These two studies suffice to illustrate the kinds of relationships be- 
tween drive and operant conditioning that are open to investigation. In 
a narrative account of the results we tend to move back and forth be- 
tween describing the rats as “deprived of food" or as ‘hungry." The 
two expressions are operationally equivalent, though when we assign the 
results to ‘hunger" instead of to ‘hours without food" we tend to imply 
a theory. This leads us back to the problem of the status of “drive” as a 
concept. 

^Skinner is quite clear that he means by “drive" merely a set of op- 
erations (such as the withholding of food for a certain number of hours), 
which operations have an effect upon rate of responding. He is interested 
in the lawfulness of these effects under various circumstances. He objects 
to most of the current uses of “drive” by supporting the following asser- 
tions: 32 

A drive is not a stimulus. 

A drive is not a physiological state. 

A drive is not a psychic state. 

A drive is not simply a state of strength. 

By these negatives he makes it clear that he does not accept the 
stimuli from stomach contractions as the prototype of drives, nor does he 
accept physiological needs, or pleasures or pains, or desires or wishes. 
For the purposes of the systematic study of behavior, the word “drive” is 
used only to acknowledge certain classes of operation which affect be- 
havior in ways other than the ways by which reinforcement affects it. He 
does not believe inference to an intermediary (intervening variable or 
hypothetical construct) to be necessary in order to carry out the func- 
tional analysis. 

Emotion and Punishment 

Just as drives are inappropriately classified as stimuli, so, according 
to Skinner, emotions are often unwisely classified as responses. Weeping 
at a bruised shin or over the loss of a game is ordinarily said to be an 
emotional response, but weeping because of a cinder in the eye is not. 

31 Stras^urger. ' 1950). 

32 Sk :ii-( j v !01 5a pages 144-146. 
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This way of treating emotion is rejected in favor of referring it to a set 
of operations, in many ways like drive. Its importance arises because of 
the accompanying or ensuing changes in response. 

The effects of punishment are often emotional by this definition. 
Punishment might act dynamically as negative conditioning, as though 
reinforcement were being subtracted, but in fact it does not serve in this 
way. If at the beginning of extinction the rat is slapped on its feet when 
it presses the lever, its rate of responding is depressed, but it eventually 
recovers. The total responses to extinction have not been reduced by 
the punishment, as they should be if the punishment were negative con- 
ditioning. 33 Such a temporary effect upon rate but not upon total re- 
sistance to extinction is emotional. There is here some support for Thorn- 
dike s belief that punishments do not act opposite to rewards/ 

Because of the importance of the problem of punishment, Estes’ 
(1944) study, carrying further Skinners earlier explorations, will be re- 
viewed later as an exemplification of Skinners system. 

DISCRIMINATION AND DIFFERENTIATION 

In situations which require more than a change in rate of respond- 
ing, two features stand out. One is the discrimination between stimuli, so 
that a given response may be made to one of a pair of stimuli and not 
to the other member of the pair. A second feature is differentiation of 
response, so that response form or topography is altered or adjusted 
appropriately to the situation. The complexities of behavior can be un- 
derstood according to discriminations and differentiations arranged into 
appropriate chains or patterns. 

Discrimination of Stimuli 

The standard lever-pressing experiment may serve the purposes of 
discriminatory conditioning if the lever-pressing delivers a pellet of food 
in the presence of a positive, supporting stimulus (e.g., a 3 c.p. light) 
and fails to deliver in the absence of this discriminatory stimulus. The 
rat learns to respond only when the light is on, but the light does not 
actually elicit the response. The difference between a discriminatory 
stimulus as an occasion for a response and as eliciting it is clarified by 
an example. I reach for a pencil lying on the desk, but I reach only when 
the pencil is there, and I do not reach for it just because it is there. While 
the pencil does not elicit reaching, it has something to do with my reach- 
88 Skinner (1938), page 154. 
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ing. If it were dark and there were no pencil there, I might grope for it 
because the discriminatory stimuli would be lacking. The pencil does 
not elicit reaching in the light any more than in the dark. It is only the 
occasion for reaching. 34 

When the food is delivered only in the presence of the light, the 
situation is a sort of controlled intermittent reinforcement, for lever- 
pressing is reinforced only part of the time. Eventually response occurs 
almost exclusively when the light is on, so that nearly every response is 
reinforced. The resistance to extinction that is built up at this stage is 
not that of interval reinforcement, but that of ordinary every-trial rein- 
forcement. There are two operants with the same form of response, one 
with the light on, one with the light off; they are selectively reinforced 
and extinguished. There is an interaction called induction by Skinner 
which corresponds to Hulls generalization; whatever happens to one 
operant affects the other to some extent. 

The following statements are offered to clarify this somewhat com- 
plex situation: 35 

1. Responses accumulated in the presence of the positive stimulus 
are available in the presence of the negative stimulus. This is the princi- 
ple of inductive conditioning or generalization. 

2. Responses emitted in the presence of the negative stimulus are no 
longer available under the positive stimulus. This is the principle of in- 
ductive extinction. 

3. Selective reinforcement and extinction increase the number of 
responses available chiefly in the presence of the positive stimulus. This 
is the positive half of the principle of discrimination. 

4. Responses acquired in the presence of the positive stimulus may 
become less readily available under the negative stimulus. This is the 
negative half of the principle of discrimination — the breakdown of in- 
duction. 

The discrimination experiment is always complicated by an addi- 
tional fact of importance. The positive stimulus which acts as the occa- 
sion for the response and for the reinforcement becomes itself a secondary 
reinforcing agent. 

Skinner believes that his arrangement for obtaining discrimination 
is superior to that usually used, in which a choice of responses confronts 
the animal. His rats may either respond to the lever or not respond. They 
do not have to choose between two levers, or between a right and a 

34 Skinner ( 1938), page 178. 

35 Skinner ( 1938), page 229. 
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left turn. He believes that the discrimination box is a "crude instrument” 
for studying the nature of the process, 36 and finds Tolman’s (1938a) de- 
pendence upon choice-point behavior a severe limitation. 37 The objection 
is that in the choice situation no measure of strength is obtained — only a 
measure of relative strength. 

Differentiation of a Response 

Among a number of novel and useful distinctions made by Skinner 
is that between discrimination of stimuli and differentiation of response. 
In operant conditioning, reinforcement can be made contingent on either 
(a) the properties of accompanying stimuli (when the result is a dis- 
crimination), or (b) the properties of the response (when the result is 
a differentiation). 

The rat may be taught, for example, to press the lever with a given 
force, or to hold it down for a given duration, in order for the pellet to 
be delivered. The basic rule of operant conditioning applies: that the 
response must occur before it can be reinforced. Extreme forms or values 
may be obtained by successive approximations. Operant responses are 
emitted with an original range of form or intensity (Hulls response 
oscillation). If only the more extreme values are reinforced, the whole 
distribution shifts, so that higher and higher values may be obtained. 
The training method of successive approximations permits the finally 
learned behavior to be very different from that originally emitted. 

Animal trainers are well versed in this method. As a sort of tour de force 
I have trained a rat to execute an elaborate series of responses suggested by 
recent work on anthropoid apes. The behavior consists of pulling a string to 
obtain a marble from a rack, picking the marble up with the fore-paws, carrying 
it to a tube projecting two inches above the floor of the cage, and dropping it 
inside. Every step in the process had to be worked out through a series of ap- 
proximations, since the component responses were not in the original repertoire 
of the rat. 38 

Response novelty is one of the features rather badly accounted for 
in most stimulus-response systems. Thorndike’s law of assimilation, 
Guthrie’s principle of compromise movements, and Skinner’s successive 
approximations are all attempts to deal with the problem. The concept 
of emitted behavior has advantages over that of elicited behavior. It has 
always been embarrassing for theories of the conditioning type to try 
to find original stimuli to produce the responses called for in singing a 

36 Skinner (1938), page 231. 

37 Skinner ( 1938), page 437. 

33 Skinner (1938), pages 339-340. 
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song or in writing a poem. Such stimuli need not be specified for emitted 
behavior. 


THE SEARCH FOR A UNIT OF BEHAVIOR 

In the foregoing account of some of the kinds of data which Skinner 
collects and of the concepts used in describing them, his more general 
theory has been implied rather than made explicit. Skinner is definitely 
interested in supporting a scientific theory of a specific kind. He favors 
what he calls a purely descriptive system, and a frankly analytical one. 
How he goes about system-making is best illustrated in his discussion 
of the reflex and its laws. 

Early Reliance on the Reflex as the Unit 

Skinner believes that a purely descriptive system, in order to be 
scientific and not a mere “botanizing* ’ of behavior, must choose an ap- 
propriate natural unit of behavior. A real unit will not be an artificial bit 
of behavior taken improperly out of context, nor will it be something too 
complex to enter into orderly relations as a descriptive unit. The level of 
specification found to be necessary is not something a priori, but is the 
level found in experience “marked by the orderliness of dynamic 
changes." 39 Skinner believed, especially at the outset, that his system 
had a merit not possessed by other stimulus-response systems — the dis- 
covery of this appropriate unit: the reflex , as he defined this term. 

By a reflex Skinner means a lawful correlation between a class of 
stimuli and ’ a class of responses. The reflex is not to be identified with 
the spinal reflex, which is defined topographically; its reference is solely 
to behavior and not to anatomy or neurology. The definition rests on a 
dynamic relationship. What a stimulus is, what a response is, what a 
reflex is can be defined only according to the lawful experimental con- 
sequences when the reflex unit is isolated. 

The reflex as an analytical unit is actually obtained in practice . The unit is 
a fact, and its validity and the validity of the laws describing its chancres do not 
depend upon the correctness of analytical assumptions or the :\ of a 

later synthesis of more complex behavior. 40 

{The laws of the reflex . There are static laws, dynamic laws, and laws 
of interaction. 41 

The static laws refer to the quantitative properties of representa- 

39 Skinner (1938), page 40. 

40 Skinner ( 1938), page 29. 

41 Skinner (1938), pages 12-33. 
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tive reflexes. They are reminiscent of the laws of spinal reflexes: the law 
of threshold, the law of latency, the law of magnitude of response as a 
function of intensity, the law of after-discharge, and the law of temporal 
summation. 

The dynamic laws include “classical examples,” which means, pre- 
sumably, that they too derive from the study of spinal reflexes: the law 
of refractory phase, the law of reflex fatigue, the law of facilitation, the 
law of inhibition. In addition there are other examples, logically similar 
but differing in the nature of the operations affecting reflex strength: 
the laws of conditioning, extinction, drive, and emotion^ These have 
already been considered. 

The laws of interaction give an answer to the part-whole problem. 
They are: the law of compatibility, the law of prepotency, the law of 
algebraic summation, the law of blending, the law of spatial summa- 
tion, the law of chaining, and the law of induction. 

There is a family resemblance between the topics covered in Skinners 
laws and in Hull’s postulates, although as they are given by Skinner the 
laws are defining principles and are not stated so as to be used to deduce 
behavior. Skinner’s laws are unlikely to be found false because they are 
worded in such a form that they are true at the level of general observa- 
tion. They provide a framework for further specification and quantifica- 
tion. They enter exposition quite differently from Hull’s postulates. Ac- 
tually, there is little reference to the laws as the data from experiments 
are discussed. 

How laws of this kind define a descriptive system and lead to ex- 
periments might be illustrated by a parallel example from plant growth. 
Preliminary observation would soon establish the fact that growth varies 
with the amount of water in the soil, the range of temperature, and the 
amount of sunlight. Before proceeding to further experimental work it 
would be possible to state a series of growth laws: one law relating 
growth to moisture, another to temperature, another to illumination, 
together with some presumptive laws of their interaction. For example, 
it is plausible that with more moisture in the soil plant could thrive 
at somewhat higher room temperatures. An program might 

easily be motivated by the search for the precis^^^pfof these laws. One 
can imagine the greenhouse with thermostats* i|§pKme controls^ shades, 
and sunlamps. The laws of growth could be worked out according to 
these and similar variables without any regard for the physiology of cells 
or for the chemistry of growth hormones . The observations would, in fact, 
be governed largely by the descriptive system implied in tfieHaws, but 
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their aim is not to prove or disprove these laws, for as far as it goes each 
law merely describes experimental findings. This way of investigation is 
essentially what Skinner has proposed and what he has succeeded in 
doing. There is this difference, that the isolation of a reflex to be studied 
and the selection of units of measurement is a more perplexing problem 
than defining plant growth. 

The Functional Unit Studied Is Not Necessarily the 

“Atom” of Behavior 

While Skinner continues to search for the appropriate unit of analy- 
sis, the basic datum, he appears no longer to be confident that he has 
found the unit either in the reflex or in the measured operant. Three 
terms enter into the characterization of a conditioned operant: the dis- 
criminative stimulus that is the occasion for the response, the response, 
and the reinforcement. The “same” response is not the "same” operant 
if correlated with different stimuli or with different reinforcers. The oper- 
ant under study is perhaps not a "single” operant J 

A somewhat modified suggestion as to an "element” of behavior 
emerges in the following passage which follows a discussion of the prob- 
lem of transfer: 

This leads us to identify the element rather than the response as the unit of 
behavior. It is a sort of behavioral atom, which may never appear by itself upon 
any single occasion but is the essential ingredient or component of all instances. 
The reinforcement of a response increases the probability of all responses con- 
taining the same elements. 42 

We lack adequate tools to deal with the continuity of behavior or with the 
interaction among operants attributable to common atomic units. The operant 
represents a valid level of analysis, however, because the properties which define 
a response are observable data. A given set of properties may be given a functional 
unity. Although methods must eventually be developed which will not emphasize 
units at this level , they are not necessary to our understanding of the principal 
dynamic properties of behavior. 43 

These quotations suggest that Skinner believes we are still at a very 
preliminary stage in die development of behavioral science. Perhaps his 
objection to theory (as theory is used by others) arises because of his 
strong conviction that we need to discard old concepts and make a fresh 
start. We are not far en&gh along this new road to begin the more de- 
tailed analysis called for in matured theory. 

42 Skinner (1953a), page 94. 

43 Skinner (1953a), page 95. Emphasis added. 
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Chained Responses as Illustrative of Functional Units 

What is the practical meaning of assigning “functional unity” to a 
given set of properties? Some light can be thrown on this by an analysis 
of chaining, for actual behavior as we know it comes in sequences larger 
than those of bar-pressing or spot-pecking. 

Even the bar-pressing operant is in reality a chain: 

The use of a chain cannot be avoided in operant behavior because the very 
act of reinforcement implies it. 1 2 3 4 * / 

In their discussion of chaining, Keller and Schoenfeld give a lucid 
account of the problem of determining the degree of analysis desirable 
in order to remain at the level of experimental specification . 45 They be- 
gin with six distinct operants describing what the rat actually does in 
a Skinner box: 


Operant 

Number 

Discriminative 

Stimulus 

Response of the Rat 

1 

Bar-location 

Approach of rat to front of box 

2 

. . . Visual bar 

Rising on hind legs; placing paws on bar 

3 

. . . Tactual bar 

Pressing of bar, thus activating food-magazine 

4 

. . . Apparatus noise 

Lowering foreparts to food-tray 

5 

. . . Visual pellet 

Seizing of pellet by teeth and paws 

6 

. . Pellet-in-mouth 

Chewing of pellet 


It would be possible, as they suggest, to lengthen this list, includ- 
ing the several responses making up the approach from the door to the 
bar-location, and the several ingestion responses that follow upon the 
chewing of food. Instead, however, they find it equally legitimate to re- 
duce the list, as follows: 


Operant Discriminative Response of 

Number Stimulus the Rat 

1 Visual bar Rising 

2 Tactual bar Pressing 

3 Apparatus noise Lowering 

4 Visual pellet Seizing 


The advantage of this reduction lies in the clearly observable and 
regularly recurring sequence or chain of responses, with identifiable (and 

“Skinner (1938), page 43. 

45 Keller and Schoenfeld ( 1950), pages 197-208. 
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controllable) stimuli. In this chain each response produces the dis- 
criminative stimulus for the next response. 

Does the chain operate as a unit? The well-conditioned rat makes 
the transitions so smoothly that it seems to be giving one response, not 
four. Rut the independence of the units of the chain can be tested ex- 
perimentally. 

1. If we eliminate only the stimulus for the final unit of the chain 
(the pellet), as in extinction, the first three links of the chain are gradu- 
ally weakened, but the last is unaffected. The rat will seize and eat a 
pellet exactly as before. 

2. If we now eliminate both the third and fourth links in the chain 
(the apparatus noise as well as the pellet), and carry out extinction, we 
can find out more about the chain. Reintroducing the noise after extinc- 
tion is well along again reinforces bar-pressing. Hence, during condi- 
tioning, the third link had become a secondary reinforcer. Furthermore, 
the extinction of the preceding two links in the chain did not extinguish 
the reinforcing properties of die third link. 

This kind of functional and experimental study isolates units of the 
chain which preserve some independence in the whole. These units are 
part of a chain, and their distinctiveness as units is not entirely arbi- 
trary. It would be possible to record the separate responses, and not only 
the final one. It is suggested not only that future investigations will un- 
doubtedly employ such procedures, but that exploratory attempts have 
already been made. 46 j 

The unit appropriate for experimental study turns out, in fact, to 
have a measure of arbitrariness about it. If it is a matter of convenience 
whether to measure one response of the chain, or four responses, or many 
more than that, there is evidently some selectivity exercised by the experi- 
menter. This is always true: all description is partial description. The 
nature of events is such that description of them cannot be exhaustive. 
The functional unit, for Skinner, as for other experimenters, is not given 
completely by the processes under investigation. Sometimes the functional 
unit is a simple response, sometimes a complex act, sometimes a rate of 
responding. The unit no longer has the clean dimensions of a correlation 
between a class of stimuli and a class of responses as implied in the origi- 
nal concept of a reflex. The atom of behavior proves to be evasive. ■ 

46 Keller and Schoenfeld (1950), page 203. The attempts cited are those of Arnold 
(1947). 
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* EXTENSION AND APPLICATION OF OPERANT 
CONDITIONING 

The generality of Skinner s approach to problems of behavior is 
suggested by the titles chosen for his books: The behavior of organisms 
(1938) and Science and human behavior (1953). Neither title betrays 
that the precise data derive largely from experiments on rats and pigeons. 
Although the data are now being supplemented by experiments with 
human subjects, both children and adults, normal and disturbed, in these 
books the extension of the theory is by analogy rather than by experimen- 
tation. From a scientific point of view (using the word "science” as 
Skinner uses it) the extensions merely explore the kinds of variables 
which may eventually be brought under scientific control. The ultimate 
possibilities, as Skinner envisages them, are boundless: one chapter he 
includes is entitled "Designing a culture.” 47 We are not here concerned 
with the larger political and ethical aspects of the society he presents. 
Instead, our problem is to see the steps by which laboratory learning is 
extended to encompass learning in a larger context/ For our purposes 
we may therefore select some typical problem areas and see how they 
are handled. Three such problem areas are schoolroom learning, verbal 
behavior, and psychotherapy. 

Schoolroom Learning 

Skinner has announced, and embarked upon, a program of investiga- 
tion designed to increase the efficiency of teaching of arithmetic, read- 
ing, or spelling, by using a mechanical device expected to prove far 
superior to the usual teacher. 48 One form of device presents number 
combinations for' the teaching of addition. The child punches the correct 
answer in a kind of adding-machine keyboard; if the answer is correct, 
"reinforcement” occurs by having the problem move on. The machine 
has two advantages: (1) the timing of the reinforcement is prompt, thus 
embodying our knowledge of favorable arrangements for reinforcement, 
and (2) the order of problems is determinedly anfexact understanding 
of generalization, interference, and the need for spaced review. No 
teach er can b e as skilled a reinf orcer as the machine, for the teacher can- 
not be with every child at once, commending proper responses and cor- 
recting erroneous ones. Furthermore, the teacher cannot be as skilled 

47 Skinners novel Walden Two (1948) is a sketch of a modem Utopia portraying 
the possibility of a scientific culture. 

46 Skinner (1954). 
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in determining the proper order of presentation, We can learn about 
schedules of reinforcement from precise experiments and then build them 
into our machines. Because some schedules yield far more in the way of 
stability of response than other schedules, we can hope to save a great 
deal of time in the teaching of our children. 

While as described here, the new technique can serve only to teach 
a "skill/' it must be acknowledged that a great deal of time and effort 
goes into such teaching, and that failures entail great social cost. If 
the new methods prove promising, they deserve a try, though their limita- 
tions must of course be understood. Schoolroom learning is a social ex- 
perience as well as an opportunity to learn the multiplication tables. No 
new machine will make the teacher superfluous! 

( Verbal Behavior 

Language most clearly distinguishes human behavior from that of 
other mammals. Knowledge of how we acquire language, and how we 
use it, is essential to an understanding of human learning/ Skinner has 
long been interested in verbal behavior/- As early as 1936 he produced a 
phonograph record composed of chance groupings of speech sound, 
which because they were chance were inherently meaningless. The rec- 
ord, called a "verbal summator," was used to study the words "read into" 
sounds by the listener. It was a kind of projective technique, similar in 
the auditory field to the ink-blots used in the visual field. 40 Within the 
next few years Skinner reported studies of word association, alliteration, 
and other kinds of sound patterning. 50 His William James Lectures, de- 
livered at Harvard University in 1948, are to appear eventually in revised 
form as a volume entitled Verbal behavior . We can see that verbal be- 
havior has been taken seriously as an empirical problem. 

x The main point of the analysis is that speech sounds are emitted 
(and reinforced) as any other bits of behavior. 51 Some speech utter- 
ances make demands upon the hearer and get reinforced as the hearer 
complies. This function (called the "mand" function) appears early 
in the language behavior of the child. A second function is concerned 
largely with naming (the "tact" function). The tact function leads to the 
richness and versatility of language. Its reinforcement by the hearer is 

49 Skinner (1936c); Shakow and Rosenzweig (1940). 

50 Skinner (1937), Cook and Skinner ( 1939), Skinner (1939) (1941). 

51 The best available treatment at the time of writing is the section on verbal be- 
havior in Keller and Schoenfeld ( 1950), pages 376-399, on which the following brief 
digest is based. A few suggestions and illustrations are to be found in Skinner ( 1953a), 
pages 210-216. 
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more general than mand reinforcement. The reciprocal relationship be- 
tween the speaker and the hearer, involving mutual reinforcement, is com- 
plex, but subject to straightforward analysis. The analysis calls for no , 
new principles, however, beyond those familiar in operant conditioning, j 

Psychotherapy 52 

The need for psychotherapy is said to result from the by-products 
of excessive control by other people (especially through punishment), 
whereby the individual is either incapacitated or rendered dangerous 
to himself or to others. Excessive control has by-products that are either 
emotional (fear, anxiety, anger or rage, depression) or revealed in oper- 
ant behavior ( drug addiction, excessively vigorous or restrained behavior, 
defective discrimination of stimuli, defective self-knowledge, aversive 
self-stimulation). 

The patient turns to the therapist because any relief (or promise of 
relief) is positively reinforcing. Psychoanalytic therapy can be character- 
ized most simply, according to Skinner, as follows: the therapist consti- 
tutes himself a non-punishing audience. Under these circumstances, re- 
sponses repressed by punishment tend to return. Forgotten experiences 
may be recovered, the patient may act aggressively (or at least verbalize 
aggressive impulses), the patient may exhibit strong emotion. The appear- 
ance of previously punished behavior in the presence of the non-punishing 
therapist makes possible the extinction of some of the effects of punish- 
ment. This Skinner believes to be the principal result of such therapy. 

This plausible account calls for no “explanatory fictions” — no id, ego, 
or superego, inhabiting a psychic or mental world. 

What is “wrong” with the individual who displays these by-products of 
punishment is easily stated. A particular personal history has produced an organ- 
ism whose behavior is disadvantageous or dangerous. In what sense it is disad- 
vantageous or dangerous must be specified in each case by noting the conse- 
quences both to the individual himself and to others. The task of the therapist is 
to supplement a personal history in such a way that behavior no longer has these 
characteristics . 53 

i Psychotherapy rests on the direct investigation and redirection of 
behavior itself. Behavior is itself the subject matter of therapy; the be- 
havior is not the symptom of some other subject matter, J 

These conjectures are based on analogies between human behavior 
and experimental studies largely on lower forms. More recently, how- 
ever, the methods have been extended to the study of human patients 

52 Skinner (1953a), pages 359-383. 

53 Skinner (1953a), page 372. 
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hospitalized with psychoses. Preliminary results show that vending ma- 
chines with candy or cigarettes or pictures as reinforcing stimuli can 
sustain operant behavior over long periods of experimentation lasting 
an hour a day. It is supposed that results will be even more satisfactory 
with food reinforcement, and this is being tried. The therapeutic impli- 
cations thus far have not been of primary concern, but periods of psy- 
chotic activity within the experimental hour appear reduced in frequency 
and duration, at least for some patients. 5 ? 

The foregoing discussion of teaching, verbal behavior, and psycho- 
therapy suffices to show the flavor of Skinner s theory as applied to a 
broader range of problems. The steps from the present knowledge of 
behavior to these broader fields are largely matters of technique and 
engineering, rather than of the discovery of new scientific principles. 
Nothing new in the way of dynamic principles is expected to emerge as 
the broader subject matter is investigated. 


'''■ — ■’fHE CONSEQUENCES OF PUNISHMENT AS 
ILLUSTRATIVE OF EXPERIMENTS~WITH OPERANT 

BEHAVIOR ' 


Some of the suggestions about the action of punishment which Skinner 
had earlier made were more extensively studied by Estes ( 1944) under his 
direction. Estes’ work serves as a convenient example of the experimental 
approach to a problem by an investigator adopting Skinner’s general point 
of view. Estes chose a more conventional terminology than Skinner, avoid- 
ing words like "operant” or “reflex” or "emitted,” but the experiments could 
have been described in Skinner’s words. It is of some interest that Estes 
found the more usual vocabulary satisfactorily precise for his purposes. 

On the assumption that punishment does have effects, the two chief 
possibilities are that it weakens the habit or that it merely suppresses 
response. If the habit is weakened, punishment acts as a negative rein- 
forcement, or as an agent hastening extinction. If the response is merely 
suppressed, the response has not been eliminated from the organism’s 
repertory. Skinner’s preliminary experiments favored the latter alternative, 
that punishment suppresses the rate of responding without eliminating 
responses from the total of responses to appear in extinction. Estes points 
out that the alternatives are genuine ones in the light of clinical experience. 
It is familiar that behavior may be repressed, and not overtly expressed, 
although the tendencies continue to exist at considerable strength. 

54 Skinner, Solomon, and others (1954) 
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Punishment Does Not Act as a Negative Reinforcement 

In agreement with Skinner (and the later Thorndike), Estes finds 
that punishment does not lead to a reduction in the total number of 
responses given during extinction, even though there is temporary sup- 
pression of response following the punishment. jSome characteristic find- 
ings are shown in Figure 13. 
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FIGURE 13. Evidence tbat punishment depresses rate of responding without 
decreasing the total responses within extinction. Responses of two groups of 8 rats 
each are compared. While the groups were not alike in their response frequency prior 
to extinction, their rates have been equated, and other comparisons are based upon 
this adjustment. Within the first extinction period one group of rats received an 
average of 9 punishments (electric shocks while pressing the lever). This punish- 
ment reduced the number of responses on the first day, and perhaps on the second, 
but by the third day there was a compensatory increase in the number of responses 
so that the total responses of the group punished on the first day and of the group 
not punished are the same over the three days of extinction. Data from Estes (1944), 
Experiment A,- corrected for differences prior to extinction. 

fOn the basis of additional experiments in the series, Estes concludes 
that the total number of elicitations of the response necessary for extinc- 
tion may be reduced somewhat by punishment, but the time required for 
complete extinction of tfye response will not be affected by the p unis hment 
More generally, the conclusion is reached that a response cannot be 
eliminated from an organisms repertory by the action of punishment alone. 
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Punishment More Importantly Associated with Stimuli 
Than with Responses 

If punishment were a reinforcing agent similar to reward, it would 
act on responses, in accordance with Skinner's principle of Type R con- 
ditioning, That its more significant correlation is with discriminative stimuli 
is pointed out by Estes on the basis of several kinds of evidence. 

In one experiment 55 the animals of the experimental group were 
shocked at intervals of approximately 30 seconds, care being taken not 
to shock them during or immediately after a response to the lever. Re- 
covery from the effects of punishment follows the same course (i.e., the 
same mathematical function fits the course) as the course following punish- 
ment of the response itself. It is argued therefore that the effect must be 
due to the contiguity of the disturbing stimuli with stimuli in the box which 
normally act as occasions for lever pressing. 

This conjecture was tested in another experiment 56 in which the rat 
was left in the box for an adaptation period following punishment. During 
this time the lever was withdrawn, so that response to it was impossible. 
The effects of the short period of severe punishment were almost com- 
pletely dispelled, confirming the interpretation that punishment was re- 
lated to the stimulating situation rather than specifically to the response. 

Intermittent Punishment More Effective than Punishment 

at Every Occurrence 

If punishment is delivered every time the response is made, the rate 
of responding is seriously depressed. While it is not depressed as much 
if punishment is given only occasionally, the effects of punishment persist 
longer in the latter case. 57 This is shown in two experiments, in one of 
which the test is made by simple extinction, the other in which the ex- 
tinction follows an adaptation period. The results are most striking fol- 
lowing adaptation. Adaptation does not bring recovery after intermittent 
punishment as it does after every-trial punishment. The results are shown 
in Figure 14. 

! Estes offers some conjectures in explanation of these results. He ac- 
cepts a twofold theory of the effect of punishment. One principle, already 
referred to, is that punishment creates an emotional state which depresses 

55 Estes (1944), Experiment I. 

56 Estes ( 1944), Experiment J. 

57 Estes (1944), Experiments E and K. The average number of punishments re- 
ceived by each rat turns out to be about the same for both groups. Hence it is 
legitimate to compare the long-time effects of the two types of punishment. 
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responses while the state is aroused. This consequence of punishment does 
not lower the resistance to extinction. It is recovered from by adaptation 
or during subsequent extinction trials. The second principle is one con- 
gruent with Hull’s interpretation of punishment, that is, an interpretation 
of the cessation of shock as reinforcing. Estes believes that the withdrawal 
responses are positively conditioned by the termination of the shock de- 



EXTINCTION EXTINCTION 

(PUNISHMENT OAY) . ( AFTER ADAPTATION) 


FIGURE 14. Evidence that intermittent punishment is more effective than regular 
punishment. The mean numbers of responses of three groups are shown, a control 
group extinguished in the usual manner, a group punished for responding whenever 
response occurred, and a group punished periodically, for responses occurring within 
every fourth minute of the 40-minute extinction period. Following the experimental 
day there were two adaptation days, when the animals were placed in the boxes 
with the levers retracted. The results for extinction are for the first day with the 
exposed lever, after these adaptation days. The adaptation period has almost com- 
pletely eradicated the results of regular punishment, but the consequences of periodic 
punishment persist. Each group consisted of 6 rats. Data from Estes (1944), Ex- 
periment K, corrected for differences prior to extinction. 

Iivered by the lever. He believes, further, that such withdrawal responses 
are conditioned more strongly through intermittent reinforcement than 
through ordinary continuous reinforcement, consistent with Skinners find- 
ings for conditioning in general. 

Punishment in the Practical Control of Behavior 

Estes suggests several practical implications of his study. 

The main finding is that a response cannot be eliminated from the 
organism s repertory more rapidly with the aid of punishment than with- 


SKINNER’S OPERANT CONDITIONING 113 

out it. Permanent weakening comes about only by non-reinforced elicita- 
tion — and this weakening process may be prevented if punishment sup- 
presses the response. This is in line with clinical findings about forms of 
aggressive or hostile responses which have come under parental or other 
social punishment. They are not eliminated until they can be brought to 
free expression, when the behavior can be appropriately redirected. 

Another disadvantage of punishment lies in the concomitant emotional 
state which is aroused. This state will suppress other responses in addition 
to the one punished. The lack of specificity of its target is a weakness of 
punishment. 

On the positive side, there are several "rules” which serve to show 
when and how punishment may be useful: 

1. Punishment may be used to hold a response at low strength. Under 
these circumstances, punishment has to continue indefinitely, as it does 
not eliminate the response. The continuing punishment is equally or more 
effective if administered occasionally than if it is given every time the 
objectionable behavior occurs. 

2. It is possible to take advantage of the period of suppressed response 
following punishment in order to strengthen some other response by rein- 
forcement. 

3. It is important that punishment be given in the presence of the 
discriminative cues for the response. Delayed punishment is likely to prove 
ineffective because it is given at a time when the discriminative stimuli 
leading to the undesired conduct are absent. 4 

The series of experiments by Estes, with their interpretations, show 
how, within the experimental and theoretical framework of Skinners 
system, it is possible to experiment upon problems genuinely relevant to 
the practical control of learning situations. The challenging difficulty 
remains of finding appropriate ways to test the implications in a social 
context. 


ESTIMATE OF SKINNER'S SYSTEM 

Skinners Position on Typical Problems of Learning ' 

Now that we have available general textbooks written from the stand- 
point of Skinner s position, it is possible to make a summary in relation to 
problems that do not fit directly into his conceptual scheme. 

1. Capacity . In a descriptive system, it is to be expected that the 
laws will contain empirical constants differing for various species and 
for different members of each species. The eating rate, for example, can- 
not be expected to remain the same for young and old animals, and for 
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animals unlike in their food preferences. Because lawfulness rather than 
laws is what the systematist insists upon, differences in capacity are not 
of central importance. There is no suggestion that at higher capacity levels 
the laws are essentially any different; verbal behavior in man, for example, 
is said to conform to the general principles of operant behavior. 

Skinner argues against the usefulness of a “trait” description in study- 
ing individual differences. A trait-name does not refer to any unit of be- 
havior suitable for study through the functional analysis that he recom- 
mends. 58 

2. Practice. Something like a simple law of exercise (practice under 
conditions of contiguity of stimulus and response) is accepted for Type S 
conditioning. The conditioning that occurs under Type R depends upon 
repeated reinforcement. The possibility is favored that maximum rein- 
forcement may occur in a single trial for the single operant, but the single 
operant is not achieved experimentally. In the usual case, the accumulation 
of strength with repeated reinforcement depends upon a population of 
discriminated stimuli and the related operants, more or less after the manner 
of Guthrie's explanation of the interpretation of the acquisition of skill 
with practice. 

The effect of a single reinforcement is greatly enhanced by the 
secondary reinforcement deriving from it. Hence there is no direct cor- 
respondence between the number of responses yielded in extinction and 
the number of responses reinforced. The schedule of reinforcements is 
very important, with the greatest yields in the way of resistance to ex- 
tinction coming from intermittent reinforcement (interval or ratio rein- 
forcement). 

3. Motivation . In agreement with Thorndike, reward is found to in- 
crease operant strength, while punishment has no corresponding weak- 
ening influence. Although there are several ways in which punishment 
enters, one is to create the variable called emotion, which reduces the rate 
of responding without reducing the number of responses yielded in ex- 
tinction. 

Drive level (determined by degree of deprivation) affects the rate 
of responding, but its level does not influence the effectiveness of a rein- 
forcing stimulus in producing resistance to extinction. There is an unre- 
solved problem of a lower limit here, for there must be enough drive 
to maintain responding and to make the reinforcer relevant. 59 

58 Skinner ( 1953a), pages 194-203. 

88 We shall meet this same problem in the next chapter, when HulFs interpretation of 
reinforcement is under consideration. 
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4. Understanding. The word “insight” does not occur in the indexes 
of Skinners books. Keller and Schoenfeld identify insight with rapid learn- 
ing, and then explain it on the basis of (1) similarity of the present prob- 
lem to one earlier solved, or (2) the simplicity of the problem. 60 

Because solving a problem means only the appearance of a solution in 
the form of a response, Skinner believes that the technique of problem- 
solving is merely that of manipulating variables which may lead to the 
emission of the response. No new factor of originality is involved. This 
makes it possible to teach a child to think. 61 

5. Transfer. Skinner prefers to use the term “induction” for what is 
commonly called generalization in the literature of conditioning. Such 
induction is the basis of transfer. The reinforcement of a response in- 
creases the probability of all responses containing the same elements. 
Similarly, the control acquired by a stimulus is shared by other stimuli 
with common properties. This interpretation of transfer is, in spirit, very 
similar to Thorndike’s. 

6. Forgetting. There is no special theory of forgetting proposed, al- 
though the distinction between extinction and forgetting is carefully main- 
tained. Conditioning is long remembered. A pigeon showed sizable ex- 
tinction curves six years after the response had been last reinforced. 
Extinction, too, is retained. Spontaneous recovery does not mean that ex- 
tinction is forgotten, for successive extinctions show the results of earlier 
ones. 

True forgetting appears to be a slow process of decay with time. 
Other kinds of response decrement are better subsumed under extinc- 
tion or interference by incompatible responses. The latter interpretation 
applies to so-called Freudian forgetting./ 

The Reflex No Longer a Satisfactory Unit 

When he first committed himself to the reflex as his unit (Skinner, 
1931), he referred to the correlation between a class of stimuli and a 
class of responses. If this correlation was lawful, he believed the term 
“reflex” to be appropriate. A case could be made for his position, but the 
consequence was no more than to discover as his unit what Thorndike 
had earlier named the “connection” or “bond.” Thorndike, too, accepted 
the correlation between situations and responses as the basis for his unit. 

When Skinner developed the notion of emitted behavior, his earlier 
unit was no longer appropriate. The discriminated operant, which is the 

60 Keller and Schoenfeld (1950), page 60. 

61 Skinner ( 1953a), pages 252-256. 
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basis of experiment and theory in the newer writing, he early referred to 
as a pseudo-reflex . 62 He pointed out that it would be a great mistake to 
suppose that the correlation established between a discriminated stimulus 
and an operant followed the laws of the reflex. Thus the systematic cor- 
relation between stimuli and responses, a legitimate extension of the" 
historical term “reflex,” was no longer appropriate in dealing with operants,.. 
The terms which enter a reflex relationship should be those of “stimulus” 
and “response.” Skinner’s concern has appropriately shifted to mere law- 
fulness of behavior. The reflex proper (the basis for respondent behavior) 
is limited to autonomic and postural responses; the operant can just as 
well run under its own name, and not be confused with these reflexes. 
Instead of insisting on the analysis into reflex units, Skinner now appears 
somewhat less sure about the ultimate unit, and is temporarily satisfied 
with a functional analysis at the level of experimental convenience. 

Co-ordination with Other Systematic Viewpoints 

Skinner’s “fresh start” approach to psychology has made it difficult 
for him to use the data collected by others, and, on principle, he rejects their 
concepts. His role in reference to the theories of others — insofar as he 
has paid any attention to their claims — has been chiefly that of a trenchant 
critic. /He has felt no responsibility for the task of inter-investigator co- 
ordination. In his book Science and human behavior (1953), written 
as a textbook, he used no literature citations, and he mentions by name, 
among writers with some place in learning theory, only Thorndike, Pavlov, 
and Freud. 

That some co-ordination is possible is shown by Keller and Schoenfeld 
( 1950), who made a serious effort to bridge the gap between the data col- 
lected by Skinner and his followers and by others interested in learning. 
They were able to find much that they could use, even though they re- 
mained within the conceptual structure of Skinner’s system. 

The almost exclusive preoccupation with animal experiments (except 
for the studies of verbal behavior, and work with abnormal human sub- 
jects thus far only sketchily reported) raises some interesting questions. 
Presumably a functional, descriptive psychology of this kind must ul- 
timately design experiments at the level at which scientific statements 
are to be made. As Verplanck points out, it is somewhat anomalous that 
a systematist who refuses to predict what a rat or pigeon will do — -because 
such prediction does not belong in a scientific study of behavior — is willing 

Skinner (1935). 
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to make confident assertions about the most complex forms of human be- 
havior, economic, political, religious. 63 The intervening steps appear to be 
missing, and the theory makes no formal provision for them. 

The Anti-Theory Argument 

Skinner’s espousal of a descriptive, positivistic position and his op- 
position to a hypothetico-deductive system have frequently been reasserted, 
nowhere more clearly than in his presidential address before the Mid- 
western Psychological Association, entitled: “Are theories of learning 
necessary?” 64 

The theories he opposes are of at least two kinds. One kind is that 
which would find its explanations at a different level of discourse, using, 
for example, a physiological explanation for psychological events. There 
is no real objection to such a theory, unless it is overly insisted upon. The 
independence of the behavioral subject matter must be permitted, and 
explanatory mechanisms, especially explanatory fictions, are to be avoided. 
He challenges another kind of theory, commonly called h ypo thetico-deduc-\ 
tive, which stresses the formulation of hypotheses and their testing. This 
kind of theory Skinner objects to as a futile and extravagant exercise' The 
argument can be stated simply enough. The end result of scientific investi- 
gation, whether the investigation is descriptive or an effort to confirm a 
theory, is a described functional relationship demonstrated in the data. 
Because the end result is the same, the only, point at issue is really which 
method more efficiently directs the inquiry. Skinner is firmly of the opinion 
that theories are wasteful and misleading.Vfhey usually send the investi- 
gator down the wrong paths, and even if the scientific logic makes them 
self-correcting, the paths back are strewn with discarded theories. Once 
the data are in order, the theories tend to drop out. “If the theories have 
played no part in the design of the experiment, we need not be sorry to see 
them go.” 

Criticism of existing hypothetico-deductive systems in psychology is 
easy. This does not mean that their scientific logic is wrong. It may mean 
that some particular efforts have been premature or misdirected. Perhaps 
Skinner is right in criticizing unripe theorizing, but surely he is wrong if 
he means that we should return to a Baconian conception of science. The 
power achieved by other sciences through theoretical formulation is too 
convincing for such formulation not to serve at least as a long-range goal 

6S Verplanck ( 1954), page 311. 

64 Skinner (1950). 
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for those trying to create a science of behavior. That Skinner’s approach is 
a fruitful one need not lead to the conclusion that other approaches are 
futile. 

One of the next steps for a science, after lawful relationships have 
been discovered and appreciable amounts of data collected, is to represent 
the data in a kind of shorthand summary, using a minimal number of terms, 
and mathematical representation. This kind of systematic science Skinner 
can support, because it does not call for inferences alien to the data of 
observation. But he says of it: 


We do not seem to be ready for a theory in this sense. At the moment we 
make little use of empirical, let alone lational, equations. A few of the present 
curves could have been fairly closely fitted. But the most elementary preliminary 
survey shows that there are many relevant variables, and until their importance 
has been experimentally determined, an equation that allows for them will have 
so many arbitrary constants that a good fit will be a matter of course and cause 
for very little satisfaction. 65 

It is not surprising that a recent critic has found it appropriate to 
remark that Skinner s is "a highly formal, but not a highly formalized, 
theory” 66 


f Success of the System 


Two major achievements give the system a firm place in contemporary 
psychology. 

At the level of data, the most novel and interesting data are those 
deriving from the various schedules of intermittent reinforcement. Only 
the sketchiest of experiments have been done by other workers in this 
fascinating field, 67 and the kind of operant behavior studied by Skinner 
has lent itself beautifully to this kind of investigation. It is a far cry from 
pointing out that rewards influence learning to achieving the kind of con- 
trol whereby several thousands of responses are yielded at a uniform rate 
during extinction following a particular pattern of rewarding. 

At the level of practical behavior the most striking results have been 
obtained in animal training. While these have not been of very great 
theoretical interest, it is not wise to dismiss them as merely signs of 
cleverness on the part of the trainers. These practical demonstrations serve 
as important empirical supports for certain aspects of the system — a kind 
of support very much needed for learning theories, and notably lacking 
thus far. No other learning theorist has been able to train an animal before 


65 Skinner ( 1950), page 216. 

66 Verplanck ( 1954), page 295. 

67 For a critical review of the literature, see Jenkins and Stanley ( 1950). 
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an audience in a prompt and predictable manner, while at the same time 
epitomizing the principles of his theory. There have long been public 
demonstrations of learned behavior — of conditioned responses, of maze 
learning, of discrimination — but these demonstrations have usually relied 
upon exhibiting the results of earlier training. By contrast, Skinner s pigeons 
can be brought before a class and taught various “stunts” before the eyes 
of the students. The empirical demonstration that learning is under the 
experimenter’s control is important, and if what happens can be described 
in terms of the system, so much the better for the system. 

The practical use of the system is based on the complementary 
principles of control through presenting and withholding food reward for 
a hungry animal. The supplementary principles of stimulus discrimination 
and response differentiation suffice to inaugurate the method of successive 
approximations. Beyond that, all that is needed is the experimenter’s in- 
genuity. It is not necessary to worry about anything precise in the way 
either of experimental data or of correlated principles. From the point 
of view of & theoretical achievement this is really a pretty modest ex- 
tension .of Thorndike’s law of effect. Whether or not a child can be taught 
to write a limerick by the same methods can only be known when the 
limerick gets written. The theory does not propose to predict! 
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Chapter 5 

HULL’S SYSTEMATIC BEHAVIOR 
THEORY 

C LARK L. HULL (1884-1952), greatly impressed by the appearance 
of Pavlovs Conditioned reflexes ( 1927 ) , began thereafter a long series 
of theoretical and experimental studies that in their totality comprise the 
best example of hypothetico-deductive system-making to appear during 
the first half of the century. The system is a behaviorism, and as such falls 
into the family of theories to which Guthrie's and Skinner’s also belong. 
Each of these three systems represents in its own way a fulfillment of the 
behavioristic program originally proposed by Watson. (Like Watson's, 
Hull’s theory is avowedly mechanistic and studiously avoids reference to 
consciousness. Its central concept is habit, and it derives most of its in- 
formation about habit from experiments with conditioned responses. Com- 
plex behavior, furthermore, is derived step by step from what is known 
about more elementary forms of learning. In these respects the theories of 
Watson and Hull are alike, but in other respects Hull's system represents 
a great advance over Watson's. Hull took the detailed findings of con- 
ditioning experiments much more seriously than Watson, who was satis- 
fied to make use of the general paradigm provided by conditioned re- 
sponses. Hull adopted (and adapted) Thorndike's law of effect, whereas 
Watson rejected it. For Watson’s policy of denials and negations, Hull 
substituted a positive program of trying to explain purposes, insights, and 
other phenomena difficult for a behaviorism to encompass. 

THE BASIC ORIENTATION 

Before turning to the more formal system, expressed in postulates, 
corollaries, and theorems, we may do well to examine the bxoad frame- 
work within which the system operates. 
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Intervening Variables and Their Anchorage in Observables 

In the first revision (1929) of his widely used textbook Woodworth 
suggested that we substitute the formula S-O-R for the earlier recom- 
mended S-R formula. The stimulus (S) affects the organism (O), and 
what happens as a consequence, the response (R), depends upon O as 
well as upon S. Hull's system may be thought of as a herculean elabora- 
tion of this S-O-R formula, 1 

In our experiments we measure environmental influences upon the 
organism (the input), and then measure the organism's responses (the 
Dutput ) . These measures provide firm anchorage for our data in the en- 
vironment, where objectivity can be achieved and maintained. Input and 
output are not comprised exclusively of the experimentally studied stimuli 
and responses. Other influences upon the organism can be treated as ex- 
perimental variables, such as prior history of training, deprivation sched- 
ules, injection of drugs, and these influences can be described as objectively 
as stimuli and responses. What goes on within the organism we have to 
Infer, and in the course of making these inferences we postulate certain 
intervening variables or symbolic constructs. If we tie these inferences 
firmly to the input-output terms by way of quantitative mathematical 
statements, we lose nothing in objectivity, and gain something in con- 
venience, understanding, and fertility of deducing new phenomena. This 
is the basic logic of the system. 

Reinforcement the Primary Condition for Habit Formation 

\ln his choice among the three major possibilities (contiguity alone, 
reinforcement alone, or a dual theory), Hull stood firmly for a reinforce- 
ment theory. As early as 1935 he suggested that the Pavlov experiment 
might be considered a special case under Thorndike's law of effect, and in 
1937 he published a formal derivation of Pavlovian conditioning on the 
basis of a reinforcement principle. 2 Although some modifications were 
made in his interpretation of reinforcement, he held to the end to the one 
principle as central to learning^ 

Reinforcement theory of the kind Hull espoused requires, in the 
specification of a primary reinforcing state of affairs, either drive reduction, 
as in need satisfaction, or drive-stimulus reduction, as in the satisfaction 
ofa craving rather than a need. Hull earlier held to the drive-reduction 
theory, most clearly exemplified in escape from a continuing noxious 

1 Spence ( 1952a), page 646. 

*Hull (1955a) (1937). 
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stimulus, such as an electrically charged grid. The activity that terminates 
the noxious stimulus is reinforced because the need to escape injury is 
satisfied in the escape. Later, under the influence of the Miller-Dollard 
theory of drive-stimulus reduction, 3 Hull went over to the drive-stimulus 
reduction theory and abandoned the drive-reduction interpretation of 
reinforcement. The example of escape from a noxious stimulus was equally 
cogent, though now the_reduction in pain (the consequence of stimulation) 
is theoretically the basis for reinforcement instead of the escape from in- 
jury (the underlying need). In primary reinforcement the two events 
(drive and drive-stimulus) are so closely associated that it does not matter 
very much which is assumed to be reduced. But needs may require time 
fpr their satisfaction (as in the time required to digest food) whereas 
incentives (including food) work promptly as reinforcers, more as stimuli 
might be expected to work. Furthermore, as secondary reinforcement 
came into greater and greater prominence in reinforcement theories, the 
stimulus-reduction theory became even more attractive. This follows be- 
cause secondary reinforcement is a function of stimuli rather than of needs. 

Integration of Behavior Sequences Through Anticipatory 

Responses 

Most of the primary behavioral laws in Hulls system were derived 
from either classical or instrumental conditioning (both interpreted as 
illustrations of learning under the control of reinforcement). But the 
appeal of the system rests upon the comprehensive explanation that it 
provides of many phenomena of learning in experiments not classifiable as 
simple conditioning: more complex trial-and-error and discriminatory 
learning, maze learning, rote memorization, tool using, and so on. In order 
to make these further deductions, Hull proposed a number of intermediate 
mechanisms, derivable from the basic laws of his system, but, once derived, 
of very wide applicability. At this point the intent is merely to give some- 
thing of the flavor of these intermediaries; more detailed illustrations will be 
presented later in this chapter. 

Many of the stimuli present at the time the goal is reached are also 
present earlier. These include the stimuli from the drive (what Guthrie 
called maintaining stimuli), environmental stimuli present both earlier 
and during reinforcement, traces from earlier stimuli persisting to the 
goal, as well as stimuli aroused by the animals own movements. Hence 
in repeating a sequence of acts leading to a goal, as in running through a 
maze, there are always enough of these stimuli conditioned to the goal 

3 Miller and Dollard ( 1941). 
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response to elicit fractions of the goal response prior to reaching the goal. 
These fractional antedating goal-responses ( r G ’s ) are very important 
integrators in Hulls system, and he made very ingenious use of them. 
While they were prominent all along in Guthrie’s system, Guthrie never did 
formalize their use as Hull did. 

The fractional anticipatory responses give rise to stimuli ($g). These 
stimuli sometimes serve as surrogates for directing ideas. In that case the 
r G is a “pure-stimulus act,” that is, an act that serves functionally merely 
to produce a stimulus that maintains a steering role in behavior. 4 

These same stimuli (s G ) are very important in secondary reinforce- 
ment, permitting reinforcement originating in food at the end of the maze 
to move back and reinforce turns far from the goal. 5 

Furthermore, it is these same stimuli (s G ) that generate the gradient 
of reinforcement and the habit-family hierarchy , important intermediate 
mechanisms, which we shall meet later on. 

Thus, on the basis of primary behavioral postulates, Hull deduced 
types of mechanism that led to wide generality in his deductions. 

The Requirements for a Quantitative, Deductive System 

Hull knew very clearly what he wanted in the way of a formal system. 
Such a system should begin with adequately defined terms, and then state 
a few (as few as possible) basic postulates. These postulates may either be 
very general empirical findings, and thus independently verifiable, or, if 
not directly testable, they must be subject to indirect verification. Their 
purpose is to relate the fundamental intervening variables by strict logic 
(and quantifiable mathematical equations) to each other and to their 
anchorages in environmental events. These postulates, taken together with 
the definitions, will generate new testable deductions or predictions. These 
are the corollaries and the theorems of the system. 

The progress of science comes about through the experimental testing 
of theorems. When agreement is found, the postulate system generating 
the theorem remains in the running; when disagreement is found, a 
search is made for the faulty postulate, if such there be, and the postulate 
is then revised. Possibly, if no faulty postulate can be found, a new postu- 
late may have to be added. Thus the system is self-correcting, and only 
those features survive that have stood the scrutiny of meticulous experi- 
mental testing. 

Systems of this kind begin as “miniature” ones. That is, they encom- 

^ Hull (1931). 

5 Hull ( 1952a ) , page 14. 
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pass at first only a limited range of data. Even these limited data put the 
early system under strain, and the next steps consist in trying to secure a 
better fit between the theory and these data. Then additional data are 
incorporated through extensions of the system. Sometimes two or more 
“miniature systems” may be combined, as appropriate bridging concepts 
are developed. There may be major shifts in the systematic formulations, 
as when, in physics, Newtonian theory became a special case under 
Einstein’s more general theory. Thus theory construction proceeds by 
successive approximations, and finality in a theory is often a sign that 
the theory is faulty. 

Did Hull succeed in producing a good system, according to these 
criteria? A satisfactory answer may perhaps have to wait for the verdict 
of history. If a generally acceptable theory of learning eventually emerges 
with convincing evidence of its ancestry in Hull’s earlier formulations, he 
will have succeeded. If the direction in which he moved turns out to have 
been a blind alley, and greater success comes from a fresh start in new 
directions, then his success will not have been as great. Even so he will 
have done a profoundly valuable service in showing what a mature system 
might look like, and in insisting that it was not too early to attempt to 
systematize psychology in rigorous fashion. 

THE FINAL SYSTEM 
Earlier Sets of Postulates 

Hull’s theoretical system evolved gradually. He habitually wrote down 
various conjectures and plans for experiments in bound notebooks, twenty- 
five volumes between 1915 and 1951. 6 Many of his later publications were 
foreshadowed in these notes and in the frequent mimeographed memo- 
randa that he circulated. His first paper addressed specifically to the 
subject matter of his final system was in 1929, entitled “A functional inter- 
pretation of the conditioned reflex.” In it he acknowledged his great in- 
debtedness to Pavlov and began on an informal basis to make the kinds of 
deductions he later sought to formalize. Another important influence upon 
his later theorizing arose out of a summer’s teaching at Harvard in 1930, 
when he was invited to lecture on aptitude testing because of the favorable 
reception of his recent book on that topic ( 1928 ) . He there met C. I. Lewis 
and other philosophers, and asked them why philosophers had neglected 
his doctoral dissertation on concept formation. 7 Their answers are no 

e Hull (1951), page 120. 
t Hull (1920). 
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longer available to us, but in any case he added to his library, and read, 
Newton's Principia, and Whitehead and Russells Principia Mathematica. 
These raised his sights as to the kind of theory to which he might aspire. 
The first formal system, using definitions, postulates, and theorems, ap- 
peared in 1935. 8 It was what he called a "miniature system" concerned 
with rote learning, to be elaborately worked over later in the most detailed 
of Hulls formal systematic efforts, the collaborative book entitled Mathe - 
matico-deductive theory of rote learning . 9 / 

I The true ancestor of the final postulate set, however, was contained 
in Hull's presidential address before the American Psychological Associa- 
tion at Dartmouth in 1936. 10 This new 4 miniature system" was concerned 
with adaptive behavior of the kinds reflected in the later postulate sets 
in Principles of behavior (1943), 11 Essentials of behavior (1951), 12 and 
A behavior system ( 1952a ) v The final book was completed only a few 
weeks before Hull's death and appeared posthumously. It was written 
while he was suffering from a heart condition and could work only a few 
hours a day. It is a tribute to his devotion to his work that it was finished at 
all. Some of the changes in the latest version may very well turn out not 
to have been improvements. The system was by no means a finished one, 
and Hull was the first to acknowledge this. The promised books on in- 
dividual differences and on social behavior did not get written, and the 
final book is full of qualifications and reservations. If therefore we base 
the exposition on the final book, and have some faults to find, this need 
not be taken as disparaging to Hull. The book carries the flavor that he 
intended, and the details would have to be filled in and corrected in any 
case. Such was the kind of system Hull envisaged. 

In the pages to follow we shall trace through the postulates and 
corollaries of the final system one at a time, and then summarize the 
system, before turning to some of the deductive consequences of the 
system. There are a total of 17 postulates and 17 corollaries. An introduc- 
tion to these postulates and corollaries necessarily has to be somewhat 
detailed if it is to be at once expository and critical. The effort has been 
made to keep the discussion within bounds by reducing the statement 

8 Hull (1935a). 

8 Hull and others ( 1940). 

10 Hull (1937). 

II For an introduction to the postulate set of 1943, see the earlier edition of this 
book, Hilgard (1948), pages 80-91. 

ri This short book contains, with a few slight differences, the postulates in the final 
book. Essentially the same postulates were listed in an earlier journal article, Hull 
(1950a). 
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of the postulates and corollaries to their essentials through paraphrasing 
and eliminating some of the quantitative mathematical expression. 

K The Sensitive and Responding Organism 

The organism, before it learns, brings with it certain tendencies to 
respond and the necessary sensory equipment to be affected by the en- 
vironment. Hull’s first postulates acknowledge these starting points for a 
stimulus-response theory. 

Postulate I. Unlearned stimulus-response connections 

Organisms at birth possess a hierarchy of need-terminating responses that 
are aroused under conditions of stimulation and drive. The responses activated 
by stimulation under conditions of need are not a random selection of the organ- 
ism’s responses, but are those most likely to terminate the need. 13 

The first postulate recognizes a kind of biological trial and error 
favoring survival, based presumably on evolution. It actually plays little 
role in the system. 

Postulate 2. The molar stimulus trace and its stimulus equivalent 

A. Stimuli impinging upon a receptor give rise to afferent neural impulses. 
These constitute a “self-propagating molar afferent trace impulse,” providing 
the equivalent of an .increasing stimulus, reaching its maximum in about .450 
seconds. 

B. A new phase follows the maximum. This subsiding phase of the molar 
afferent trace impulse gives rise to a second stimulus equivalent. The subsiding 
phase endures longer than the first (recruitment) phase. 

C. The intensity of the molar stimulus trace ( s ) (i.e., the effective stimulus 
in associative learning) is a logarithmic function of the molar stimulus equivalent 
of the trace. 14 

(,The second postulate sets the stage for one of the asserted funda- 
mental conditions for associative learning: the contiguity between stimulus 
and response. As with Guthrie, the contiguity that counts is inside the 
organism — in Hull’s terms, the coincidence between the molar stimulus 
trace and effector activityjflence the second postulate is introduced to 
define the effective intensity of the process initiated when an external 
stimulus impinges upon a receptor. The argument leading to parts A and B 

13 Paraphrased from Hull (1952a), page 5; (1951), page 4; (1943), page 66. 

14 The three parts of this postulate are paraphrased from Hull (1952a), page 5; 
(1951), page 11. The earlier postulate, Hull (1943), page 47, has been considerably 
revised in order to make use of quantitative data from the experiments of Reynolds 
(1945a) and Kimble (1947), How he uses their data can be found in Hull (1952a), 
pages 101-104. Because the equations stated in quantitative form in Hull’s postulates 
are not essential to understanding the postulate set, the essence is here reported in verbal 
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of the postulate is as follows. If stimulus and response must be coincident 
in time for conditioning to occur, the most favorable interval between 
conditioned and unconditioned stimuli must be that at which the maximum 
after-effect of the conditioned stimulus coincides with the beginning of 
the unconditioned response. From the experiments of Kimble and of 
Reynolds this is known to be at an interval of about .450 seconds. Hence 
this interval appears in postulate 2A as the interval of maximum molar 
afferent trace impulse. The form of recruitment and of subsidence is the 
form of the curves obtained by Kimble and Reynolds with various intervals 
between conditioned and unconditioned stimuli. 

The introduction of the logarithmic term in 2C is apparently a con- 
cession to psychophysics. The Weber-Fechner law taught us to expect 
a roughly logarithmic relationship between the sensory response and the 
physical stimulus. This logarithmic transformation may conceivably take 
place within the receptor, within the afferent processes, or within the 
cortex. Hull has rather casually assigned the transformation to the afferent 
process — the molar stimulus trace bearing a'Togarithmic relationship to 
the molar stimulus equivalent of the traced 

Motwationand Reinforcement 

We novrvjome close to the heart of the theory, which is unified around 
the conception that learning takes place only as a consequence of rein- 
forcement. 

Postulate S. Primary reinforcement 

When a response (R) is closely associated with a stimulus trace ($) and 
this stimulus-response conjunction is associated with a rapid decrease in drive- 
produced stimuli (S D ) there will result an increase in the tendency for that 
stimulus trace (s) to evoke that response (R). 

The rapid decrease in the goal stim ulus ($ G ) is also reinforcing. 16 

We may note that the association which gets strengthened is alike 
for both Guthrie and Hull: a contiguous stimulus-response connection. 

15 Placing the logarithmic transformation where he does would perhaps classify Hull 
with those favoring “outer” psychophysics, as contrasted with Fechners “inner” 
psychophysics. See Dodge (1911); Boring (1950), pages 291-292. 

16 Paraphrased from Hull (1952a), pages 5-6; (1951), page 20. See also (1943), 
page 80. The decrease in S» as the basis of primary reinforcement is the more usual 
statement of the postulate, the choice having been made between decrease in drive ( D ) 
and in the drive stimulus ( Sd ) , The insertion of Sq into the postulate on primary rein- 
forcement appears to be an after-thought, based on the possibility that even when 
responses are reinforced by food it is not the drive that is satisfied but some derivative 
of the drive (i.e., some Sg), such as taste or appetite. Because r G and s G are central to 
secondary reinforcement, it would probably be wise, in considering the system as a 
whole, to forget that Hull included $a in Postulate 3. See the discussion of Corollary 
tv, page 148, 
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But for Guthrie it gets fixed merely because it occurs; for Hull it gets 
strengthened only because it occurs in association with reinforcement . 
Primary reinforcement is brought about, according to the postulate, by 
the decrease in the stimuli produced by a drive, for example, the removal 
of a noxious stimulus, the reduction of hunger pangs (or by a decrease 
of stimuli associated with the anticipated satisfaction of these drives ) . Hull 
notes that he has vacillated somewhat in his earlier discussions between 
a reduction in need or drive (D), and a reduction in drive stimuli (Sd). 
He ends up with the latter, but with some reservations, and, in fact, leaves 
the matter open for revision. 17 

Corollary i. Secondary drive 

When a neutral stimulus trace (s) has been closely associated with the 
evocation and rapid decrease of drive-produced stimuli (S D ), the hitherto neu- 
tral stimulus trace (s) acquires a tendency to bring about these drive stimuli 
(S D ), so that the previously neutral stimulus trace ( s ) becomes the occasion for 
a secondary drive ( s » S D ). 1S 

This corollary expresses an association between two stimuli: the 
neutral stimulus trace (s), and the drive stimulus (S D ) . Such an association 
is foreign to Hull’s system, for learning must always involve a response 
term. Actually a response intermediary is implied, even though not stated. 
This may be illustrated by the acquisition of fear as a secondary drive. 
The primary drive is pain, producing the original drive stimulus (S D ). 
Neutral stimuli associated with pain give rise to “fear” responses, very 
similar to responses to pain, and the proprioceptive consequences of these 
learned responses produce the drive stimulus (S D ) that serves the second- 
ary drive.* 9 The corollary, however, asserts nothing about this response 
intermediary. 

Corollary ii. Secondary reinforcement 

When a neutral stimulus trace ( s ) has been closely associated with a rapid 
diminution in drive produced stimuli (S D ), the hitherto neutral stimulus trace 
(s) acquires a tendency to bring about the reduction of S D , so that the previously 
neutral stimulus trace ( s ) acquires “the power of acting as a reinforcing agent.” 20 

Despite the seeming formality of his exposition, Hull used many 
variations in wording when stating his postulates, so that precise meanings 
are sometimes hard to infer. I have restated the two corollaries in closely 
parallel form, which he did not do. Then it becomes evident that a neutral 

Hull (1952a), page 153. 

is Paraphrased from Hull ( 1952a), page 6; ( 1951 ), page 25; ( 1943 ), page 98. 

is N. E. Miller ( 1948a ) ( 1951b ) . 

20 Paraphrased from Hull ( 1952a), page 6; ( 1951 ), page 28; ( 1943), page 95. 
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stimulus trace associated consistently with a reinforcing state of affairs 
acquires two functions ( and both at once! ) : ( 1 ) the power to arouse a 
secondary drive (Corollary i), and (2) the power to reduce drive stimuli, 
and hence act as a secondary reinforcing agent (Corollary ii). 

The distinction between secondary drive and secondary reinforce- 
ment is an important one, and practical examples can be found to make 
the distinction clear. For example, fear and anxiety easily classify as 
secondary drives, and their reduction is reinforcing. Similarly, distinctive 
food-boxes, o fr poker-chips used as tokens, readily classify as secondary 
reinforcing agents. A chimpanzee will work for a token only when hungry: 
the drive remains the usual hunger drive, but the secondary reinforcing 
agent is not itself hunger-satisfying. Hull intended to preserve this distinc- 
tion in the corollaries, but the formal statements do not specify when the 
neutral stimuli will evoke S D (then serving as secondary drives), or 
when they will reduce S D (then serving as secondary reinforcers). The 
only hint of a difference is that for the acquiring of a secondary drive 
the neutral stimulus must be present at the time of drive evocation as 
well as drive reduction, while, for the acquiring of secondary reinforce- 
ment, the neutral stimulus has to be present only at the time of drive re- 
duction. 

■ The attempt to interpret the actual events in secondary reinforcement 
came late in Hull's theorizing, and he probably had not developed a 
theory fully satisfactory to himself. That he was attempting to hold to a 
consistent drive-stimulus reduction theory of secondary reinforcement ( as 
well as of primary reinforcement), is clear from the following statement: 

It follows that any stimulus consistently associated with a reinforcement 
situation will through that association acquire the power of evoking the condi- 
tioned inhibition, i.e., a reduction in stimulus intensity, and so of itself producing 
the resulting reinforcement. Since this indirect power of reinforcement is ac- 
quired through learning, it is called secondary reinforcement. 21 

It can be pointed out that all conditioned stimuli must develop some 
secondary reinforcing powers because they must necessarily be associated 
with a reinforcing state of affairs. 

While as paraphrased the corollaries deal only with secondary drives 
and reinforcements, they are applicable also to higher-order derivatives, 
so that tertiary drives can be built upon secondary ones, and higher-order 
reinforcements can be ba^ed upon higher-order drives. 

21 Hull (1951), pages 27- 
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3. The Law of Habit Formation 

Having chosen contiguous association under reinforcement as the 
basic qualitative condition for habit strengthening, Hull had an additional 
basic choice to make regarding the quantitative influence of reinforcement. 
Either reinforcement could contribute its maximum influence on one trial 
(the possibility accepted as a genuine one by Skinner, and corresponding 
to Guthrie s one-trial learning ) , or reinforcement could add an increment 
to habit strength on each occurrence. Hull chose the latter alternative. 

Postulate 4. Habit strength (sH R ) as a function of reinforcement 

Habit strength (the tendency for a stimulus trace to evoke an associated 
response) increases as a positive growth function of the number of trials, pro- 
vided that trials are evenly spaced, reinforcement occurs on every trial, and 
everything else remains constant. 22 

The function described as a positive growth function is of the form: 
sH b = 1 - 10-** 

where N is the number of trials from absolute zero, and a is an empirical 
constant of the order of .03 as stated in the original postulate. Such a 
function is used throughout Hull’s quantitative formulations. It can be 
understood as a function with a maximum at 1.00, this maximum being 
approached in such a manner that each unit of reinforcement increases 
habit strength by a constant fraction of the possible habit strength as yet 
unrealized. This is, of course, a curve of decreasing gains. Because the 
total range is 1.00, the unit of habit was originally defined as a hab, a 
centigrade unit extending by steps of .01 to 1.00. 2a 

^ While the postulate shows habit strength to be a function solely of 
the number of reinforcements, we cannot tell from this postulate alone 
what other influences there may be upon habit strength. Because it is 
specified that everything else is constant, it might be that habit strength 
would be different with a given number of reinforcements if, say, drive 
were different. We may anticipate a little, however, and point out that 
there are no systematic influences upon S H R other than the number of re- 
inforced trials . It would actually have been unnecessary to state in the 
postulate that everything else is constant, for nothing else can affect sHr 
directly. 2 y 

22 Paraphrased from Hull ( 1952a), page 6; ( 1951 ), page 32; ( 1943), page 114. 

23 Hull (1951), page 32. The hab is dropped in Hull (1952a), and sHb becomes a 
multiplying factor without any units. 

24 One of the important changes between 1951 and 1952 is the dropping of the in- 
fluence upon sH» of stimulus intensity during learning. See footnote 28. 
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This is a radical departure from the earlier system, because now 
most of the influences, as we shall see, are upon reaction potential ( s Er) 
rather than upon habit strength ( S H R ). The distinction between per- 
formance and learning, so long insisted upon by Tolman, was formally 
accepted in the 1943 postulates, but by now has greatly reduced the 
quantitative influence of reinforcement upon associative learning. We 
would expect the magnitude of each reinforcement to be pertinent in 
determining the contribution of reinforcement to habit strength, but we 
are now told that this is not the case, so long as some unspecified minimum 
of reduction in S D occurs. 

N on-associative Factors Affecting Reaction Potential 

Reaction potential ( S E R ) is Hull’s term for the inferred process close 
to response evocation.^It can be inferred only from response, but is not 
to be identified with response. Reaction potential and response are not 
identical because reaction potential may be below threshold, hence not 
lead to response, or it may interact with competing tendencies and hence be 
incompletely revealed in response. 

We may conveniently think of S H R as signifying associative strength 
and of s E e as signifying the strength of the tendency to respond. At a given 
level of habit strength ( S H R ) the magnitude of the reaction potential ( S E R ) 
depends in addition upon three chief non-associative 25 factors operative 
at the time of response evocation. These are specified in the next three 
postulates. 

Postulate 5. Primary drive (D) 

A. A primary drive (at least that resulting from food deprivation) con- 
sists of two components: (1) the drive proper, which increases with the number 
of hours of food deprivation, and (2) an inanition component which reduces 
drive as starvation continues. 

B. Each drive condition generates a characteristic drive stimulus (S D ) 
which is an increasing function of the drive condition. 

C. Some drive conditions may motivate into action habits set up on the basis 
of different drive conditions. 26 

According to Spence, drive ( D ) is conceived by Hull as “a nonspecific 
state or general condition of the nervous system to which all the specific 
needs contribute. 1 ’ 27 Hence the general drive level is due not only to the 

25 Spence (1951a), 

26 Paraphrased from Hull (1952a), pages 6-7; (1951), pages 28-40; (1943) page 
253. The postulate as stated by Hull has four parts, one of which is introduced for 
illustrative purposes only. It describes the course of hunger drive as measured in one 
experiment (Yamaguchi, 1951), and is omitted here as unnecessary to the argument. 

27 Spence ( 1951a), page 249, 
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relevant drive but to all needs coexisting at the time. The drive stimulus 
(Sd) is more nearly specific to a given drive, so that it can provide for 
discrimination between drives as well as for reinforcement. There is some 
interchangeability among drives as acknowledged in the final part of the 
postulate. 

Drive activates habit strength into reaction potential. The simplest 
form of statement is: 


gE R — D x sHr 

This means that a habit of given strength will yield responses of 
greater or less magnitude, depending upon the level of drive operating 
at the time the response is evoked. For the sake of simplicity, this and 
the two following postulates assume that the levels of the non-associative 
factors have remained the same during learning and testing, in order to 
avoid the complications that ensue when a different level of drive is used 
in learning and in testing. This assumption makes it possible to provide 
quantitative expressions for the postulates with greater ease. The intent 
is, however, that drive strength has its true influence at the time of 
response evocation, that is, that it is a non-associative factor. 

The concept of drive is very important in Hull’s system. It has three 
distinct functions: 

' 1. Without drive there could be no primary reinforcement, because 
primary reinforcement requires the rapid diminution of S D . There could, 
of course, be no secondary reinforcement either, for secondary reinforce- 
ment originates in the association of a stimulus with primary reinforce- 
ment. 

2. Without drive there could be no response, for drive activates habit 
strength into reaction potential. Drive is the most important multiplier of 
sHr. 

3. Without the distinctiveness of the drive stimuli S D there could 
be no regulation of habits by the need state of the organism, no way to 
learn to go one place for water when thirsty, another place for food when 
hungry. 

The first and third of these functions together determine which in- 
centive is reinforcing, as well as why it is reinforcing; the second function 
is that of a goad to action in the direction of need satisfaction; the third 
also gives drive a discriminative or steering role. 

Postulate 6. Stimulus-intensity dynamism ( V ) 

The greater the intensity of the stimulus, the greater the reaction potential 
for any given level of habit strength. The magnitude of the stimulus-intensity 
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dynamism (V), as a component of reaction potential, is an increasing logarithmic 
function of the stimulus intensity. 28 

The basic equation parallels that for drive: 

sE R = V X sHr 

Postulate 7. Incentive motivation (K) 

The greater the magnitude of the incentive used in reinforcement, the 
greater the reaction potential for any given level of habit strength. The incentive 
motivation (K), as a component of reaction potential, is a negatively accel- 
erated increasing function of the weight of food or quantity of other incentive. 29 

Again it must be noted that the influence of amount of the reinforcing 
agent during learning is upon reaction potential and not upon habit 
strength: 

g E R = K X s H r 

We are dealing here with constant values of K within training and 
testing, so that the question of shift of incentive between learning and 
testing (as in latent learning experiments) must wait for further discus- 
sion. So, too, the problem of the influence of the perceived incentive, as 
part of response evocation, will be discussed later. 

Postulate 8. The constitution of reaction potential ( 8 E R ) 

When conditions have been constant throughout learning and response- 
evocation, the reaction potential ( S E R ) is determined by the habit strength 
( g H R ) multiplied by (1) the drive (D), (2) the stimulus-intensity dynamism 
(V), and (3) the incentive motivation (K). 30 

The general equation becomes: 

s E r = D X V X K X s H r 

This postulate merely combines the results of the preceding three 
into one equation. Several statements need to be made about this equation, 
especially about the implication of the multiplication signs and the re- 
sulting problem of units of measurement. 

The major magnitude in the determination of S E R turns out to be D 
rather than S H R . The maximum values of D, in the sigma units by which 
g E R is measured, may be as great as twice the maximum of S E R . 31 Hence D 

28 Paraphrased from Hull (1952a), page 7; (1951), page 46. The concept does not 
appear in Hull (1943). A change occurred between 1951 and 1952, in that in the 
1952 version Vi, the V during learning, no longer affects sHr. 

29 Paraphrased from Hull (1952a), page 7; ( 1951), page 51; ( 1943), page 129. 

89 Paraphrased from Hull (1952a), page 7; (1951), page 8, but with the omission 
of J and a changed role for V. The relations are considerably altered from Hull ( 1943 ), 
e.g., page 178. 

31 Hull ( 1951), page 58, states that the maximum hunger drive ranges “uncertainly 
from 6 to 12 <r” The maximum assumed for sEr is 6 
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is measured in the units of S E R . The other multiplying terms are merely 
decimal multipliers or weights, without units, each with a maximum of 1.00. 
Were sH R , V, and K all at the maximum, S E R would be equal to D. Any 
other values of S H R , V, and K make S E R less than D. A zero value for any 
multiplier (including D) makes S E R assume a value of zero. We shall re- 
turn later to a discussion of the measurement of S E R . For the present, we 
may note that the forms of the functions underlying S H R , D, V, and K 
have some degree of independent determination, but their units are en- 
tirely arbitrary. For S H R , V, and K the arbitrary maximum is 1.00, and 
they are pure numbers, without any kind of unit. Both the maximum value 
and the units of D are obtained by working backwards from S E R . Had D 
been treated symmetrically with S H R ( as it was when the unit called mote 
was assigned, paralleling hab) 32 then any one of the other terms could 
just as well have taken on the larger range of values. It is important to 
note how a tour de force of this kind makes possible the multiplication of 
four quite disparate quantities without causing the systematist major 
difficulties. A preferable solution (and an equally simple one) would 
have been to preserve the uniform (arbitrary) scale for each of the factors, 
and then to introduce an arbitrary constant to convert to the scale used for 
s E r . This would have made no difference in the end, and would have 
avoided the illusion that D was assigned more importance than the others 
on the basis of some kind of empirical evidence. It turns out to be merely 
the residual legatee of the constant required to convert to the units of 
sE r . 

Corollary Hi, Delay in reinforcement (/) 

A. For a response in a chain, the greater the delay in reinforcement, the 
weaker the reaction potential leading to that response. 

B. For a single response, the greater the delay in reinforcement, the weaker 
the reaction potential, according to a gradient that falls off rapidly at first and 
then more slowly, approaching the asymptote at about 5 seconds. 33 

The delay-in-reinforcement gradient has been very important in Hull’s 
theorizing ever since his early paper on the goal gradient. 34 The simple 
delay-in-reinforcement gradient (Part B of Corollary iii) became shorter 

32 Hull (1943), page 243. At that time the decimal was omitted, so that both sHr 
and D had maximum values of 100. 

33 Paraphrased from Hull (1952a), pages 7-8. The equation stated by Hull is based 
on a complicated determination, (1952a), pages 126-133. Hull himself had serious 
doubts about some of the assumptions involved, (1952a), page 131 (footnote), page 
133. The status as a corollary is a change from ( 1951), page 56, where J was a multi- 
plier equivalent to D, V and K. It was then a modification from ( 1943), pages 145, 178, 

34 Hull (1932). 
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as Hull’s system evolved. The earlier gradient for a single reinforced 
response extended for 30 to 60 seconds, while the final one extends but 
5 seconds. 35 

The shortening of the gradient came about as secondary reinforce- 
ment gained more prominence in the generation of the longer gradients. 
By placing the gradient in a corollary rather than a postulate, Hull ac- 
cepted Spence’s suggestion that the primary gradient is probably absent 
altogether, all gradients being generated through secondary reinforcement 
or other intermediate mechanisms. 36 

Corollary iiiA has the logical status of a theorem, extending the 
gradient to cover links in a chain of responses. 

z' 

Behavioral Summation and Withdrawal 

The next four corollaries deal with a complication of habit strength 
and reaction potential when two stimuli (along a continuum) have been 
associated with the same response through reinforcement. 

Corollary iv. The summation of habit strengths when two stimuli S and S' have 
been separately associated with a response through reinforcement 
Corollary v . The summation of reaction potentials when two stimuli S and S' 
have been separately associated with a response through reinforcement 
Corollary vi. The withdrawal of habit strength when S' is withdrawn from the 
combination of S and S' as in Corollary iv 
Corollary viL The withdrawal of reaction potential when S' is withdrawn from 
the combination of S and S' as in Corollary v 37 

These four corollaries all revolve about the same basic assumption. 
The combined habit strength is said to be composed of (1) the primary 
habit strength (sH R ) to the stimulus more frequently followed by a rein- 
forced response, and (2) the additional habit strength to this stimulus 
( sAr) generalized from the habit strength acquired by the second stimulus 
(s'Hr). The sum is less than that obtained by direct addition of these 
two components. To compute the sum you may first convert both of these 
habit strengths (sH R and sfi R ) to the number of reinforcements that 
must follow response to S in order to reach these strengths. Add these 
two N’s and from this sum compute the equivalent habit strength. This 
will be the combined habit strength you seek. An algebraic simplification 

35 For the earlier gradient, see Hull ( 1943 ), page 145, for the later one, Hull ( 1952a ) , 
page 131. 

36 Spence (1947). 

37 Corollaries iv through vii appear in Hull (1952a), pages 8-9, and in (1951) 
pages 63, 65, 67, and 69. The earlier discussion appeared in (1943), page 223. 
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reduces the computational complexity and permits computation directly 
from the habit strengths: 

Hr sAr = sH r + sS R — s^r X s^R (Corollary iv) 

The same argument applies to the computation of the combined 
reaction potential, and the steps are reversed in the withdrawal of habit 
strength and reaction potential. 

Corollary viii. Behavioral summation of incentive substances 38 

The corollary as Hull states it indicates a method of determining 
the relative weights to be assigned different incentive substances (e.g., 
water and food) in order to compute a combined value of K, the incentive 
reinforcement, when both substances are used together. ^ 

Inhibition 

The fundamental form of inhibition in most conditioning theories 
is represented by experimental extinction — the reduction in response with 
repeated non-reinforcement. One interpretation is that extinction is a result 
of competing response tendencies. This interpretation requires no new 
principle, and no negative response tendencies have to be computed. It 
is the position we have met in the views of both Guthrie and Skinner. The 
second interpretation calls for a special theory of inhibition, producing a 
negative reaction potential. Hull adopts this second kind of theory. The 
empirical and theoretical background for Hull's theory rests on Hovlands 
earlier recognition of inhibition-of-reinforcement, 39 on Neal Miller s analy- 
sis of the role of muscle strain, fatigue, and pain in extinction and spon- 
taneous recovery, and in "anticipatory relaxation," 40 and on Mowrers 
experimental test of effortfulness of task in relation to experimental ex- 
tinction. 41 

; With this background, Hull suggests that two kinds of inhibitory 
potential compete with excitatory potential to weaken it: reactive inhibi- 
tion (Ir) and conditioned inhibition (sIr). These both act as negative 
reaction potentials. They summate to produce a total inhibitory potential 
that can neutralize positive reaction potential.; 

Reactive inhibition (Ir) is generated whenever a response occurs, and 

38 The corollary is stated in mathematical form in Hull (1952a), page 9; (1951), 
page 72. 

39 Hovland (1936). Sometimes continued reinforcement leads to a decline in 
response rC'Cnnb 1: "2 CYpcdmcn+V extinction, e.g., Hilgard ( 1933). 

* ■ M'iici ;.*■(! Do'hi-d 1911). pages 40-43; Miller (1951b), pages 451-452. 

41 Mowrer and Jones (1943). 
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is akin to tissue injury, fatigue, or pain. As such, it acts as a barrier to 
repetition, directly inhibiting reaction potential. It has a second aspect in 
addition, in that it acts (just as pain does) as a drive, strengthening any 
activity associated with its reduction. 42 The activity that reduces Ir is rest 
("non-activity"), and it is this conditioned non-activity that is known as 
conditioned inhibition (sIr). 

Postulate 9. Inhibitory potential 

A. The occurrence of a response produces reactive inhibition (Ir) which 
both inhibits reaction potential and acts as a negative drive. 

B. Reactive inhibition (I R ) dissipates spontaneously with the passage of 
time as a simple decay function of elapsed time. 

C. As a given response is repeated, increments of reactive inhibition sum- 
mate. The resulting I R also summates with conditioned inhibition (sIr) to pro- 
duce the aggregate inhibitory potential (I R ). 

D. When non-reinforced responses follow each other at short intervals, the 
aggregate inhibitory potential (I R ) increases as a positive growth function of 
the number of non-reinforced trials, thus yielding the phenomena of experi- 
mental extinction. 

E. Because the aggregate inhibitory potential (I R ) increases with the 
magnitude of the work involved in each response, the greater the work involved 
the fewer the number of unreinforced responses to a criterion of experimental 
extinction. 43 

Portions of this postulate ( especially parts C, D, and E) are intimately 
related to the three following corollaries. 

Corollary ix. Conditioned inhibition 

Stimulus traces closely associated with the cessation of a response, in the 
presence of I R (which is assumed to decrease also when the response ceases) 
become conditioned to the particular non-activity. This conditioned non-activity 
is known as conditioned inhibition ( S I R ), and opposes reaction potential (gE R ). 
The amount of s Ir generated increases with the amount of I R present. 44 

The systematic considerations here are actually very complex, and 
they are treated rather sketchily by Hull. Because sIr is a habit, it ought 
to have two components, the equivalents of sH R and s Er. Actually, Hull 
treats only the reaction potential (sE R ) equivalent.^onditioned inhibi- 
tion (sIr) is said to be a function of the amount of Ir because the more 
Ir there is the more rapidly it can diminish and hence the more reinforcing 
it can be. 45 This argument reflects outmoded features of the system. A form 

42 Hull did not follow up all the implications of treating In as a drive. He did not 
subtract it from D, which as a matter of course his system requires; he did not use it 
in conjunction with D as a multiplier in the determination of reaction potential. 

48 The five parts of Postulate 9 are paraphrased from Hull (1952a), pages 9-10; 
( 1951), pages 74-81. Antecedents occur in ( 1943), especially page 300. 

44 Paraphrased from Hull (1952a), page 10; (1951), page 75; (1943), page 300. 

45 Hull (1951), page 75; Kimble (1949), page 22. 
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of the theory more consistent with the current system would simply make 
a stronger drive (Ir) a larger multiplier, with the same result — a larger 
S I E resulting from larger values of I R . 

In Corollary ix, conditioned inhibition is derived from the principle 
of reinforcement (Postulate 3) by interpreting inactivity as a response and 
Ik as a drive. Hence it is legitimately a corollary rather than a postulate, 
although Hull is not always clear about his basis for classifying the several 
principles. The corollary is necessary to provide a definition of conditioned 
inhibition ( S I R ) already made use of in Postulate 9C. 

Corollary x . Aggregate inhibitory potential as a function of work 

The aggregate inhibitory potential that is generated through complete ex- 
tinction requiring a fixed number of trials is an increasing function of the 
magnitude of the work involved in each response. 46 

Corollary xL Aggregate inhibitory potential as a function of the number of re- 
sponses 

The aggregate inhibitory potential that is generated through complete ex- 
tinction requiring a fixed magnitude of work per response is an increasing func- 
tion of the number of trials. 47 

Corollary x follows from Postulate 9E, but restates the postulate for 
a constant number of trials to complete extinction, instead of a constant 
reaction potential. A good guess as to the reason for this and the following 
corollary is that scaled data were available from experiments based on 
responses to extinction. 48 

Corollary xi bears a corresponding relation to Postulate 9D. The 
postulate states the number of trials to extinction when reaction potential 
is constant and the work per response varies, while the corollary states 
the number of trials to extinction when work is constant and reaction 
potential varies. This implication of the corollary is not self-evident, until 
we remember that the total amount of inhibition generated during ex- 
tinction must equal the total amount of reaction potential that it must 
neutralize. Hence with constant work, the only independent variable im- 
plied in Corollary xi is the amount of reaction potential to be overcome 
through the generation of inhibition during extinction. 

Equivalence and Interaction of Stimuli 

We now come to two postulates concerned with stimulus interrela- 
tionships, the first with generalization , when one stimulus is to some degree 

46 Paraphrased from Hull (1952a), page 10; (1951), page 84; (1943), page 279. 

47 Paraphrased from Hull ( 1952a), page 10; ( 1951 ), page 85. 

48 The experiments are those of Mowrer and Jones (1943), Perm (1942), and 
Williams (1938). 
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equivalent to another, the second with afferent stimulus interaction when 
two or more stimuli are active at once. The earlier corollaries that dealt 
with interaction effects (Corollaries iv through viii) considered the sum- 
mation and withdrawal of reaction potential, once habits have been set 
up. Here the concern is with the properties of stimuli and with the changes 
in stimulus traces when more than one stimulus is active at once. 

Postulate 10. Stimulus generalization 

A. The generalized habit strength (s 2 H R ) associated with a stimulus trace 
s 2 differing in quality from the stimulus trace involved in the habit ^Ha) 
depends upon the remoteness of s 2 from s x on a qualitative continuum in units of 
discrimination thresholds (j.n.d.’s). 

Provided D, K, and V remain constant, the generalized reaction potential 
(<? 2 E r ) will vaiy directly with s 2 H B . 

B. The generalized habit strength (s 2 H R ) associated with a stimulus-trace 
($ 2 ) differing in intensity from the one (s 1 ) involved in the prior learned habit 
(SjHa) likewise depends upon the remoteness of s 2 from s v Because the differ- 
ence is one of intensity, log units may be used instead of j.n.d.’s in determining 
the stimulus separation. The amount of generalization, furthermore, is a function 
of V.^ the stimulus-intensity dynamism of s 1 during conditioning. 49 

Provided D and K remain constant, the generalized reaction potential 
($ 2 E u ) will vary directly with s 2 H R and V 2 , the stimulus-intensity dynamism of 
s 2 during testing. 

C. Generalized conditioned inhibition (s 2 I R ) follows the same principles 
of generalization from along qualitative and quantitative stimulus continua 
as generalized habit strength and generalized reaction potential. 60 

The postulate means essentially that the more alike two stimuli are, 
the more nearly one can substitute for the other in arousing conditioned 
responses. The differences between stimuli along a dimension permit 
description of a gradient of generalization along this dimension. Such a 
gradient is implied in each of the three parts of the postulate. A number 
of closely related attempts have been made to derive the form of the 
gradient from empirical data, and to apply the results to problems of 
stimulus equivalence and to discrimination. 51 

Corollary xii. The generalization of habit strength and reaction potential with 

change in strength of drive 

When a response is learned under one_ intensity of drive and tested under 
another, both generalized habit strength ( S H R ) and generalized reaction poten- 

40 This use of Vi appears to be a “hangover” from the 1951 postulate set. This is the 
only place in the final postulates where habit strength is any longer a function of 
stimulus intensity. See footnote 28. 

50 The postulate in its three parts is paraphrased from Hull (1952a), pages 10-11; 
(1951), pages 88-92 (with minor systematic differences); (1943), pages 199, 264. 

siE.g., Spence (1936) (1937) (1939) (1942); Hull (1939a) (1947b) (1949) 
(1950b). For related papers of his own, and a critical review of the Hull-Spence 
formulations, see Razran (1949). 
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Hal (,sEk) will be decreased, their strengths following a gradient based upon the 
amount of difference in drive strength during learning and during testing . 52 

The meaning of this corollary is not clear without some explanation. 
Drive-stimulus habit strength ( s ^Hu) is always a component of the total 
habit strength resulting from learning. (We have not met this in exactly 
this form before, but Sd is always present with the response that is being 
reinforced, and so enters into association with it.) Hence under a new 
drive intensity, the total generalized habit strength will have as a com- 
ponent a reduced generalized drive-stimulus habit strength ( s H R ). The 
reduction follows from Postulate 10, the new drive stimulus being at some 
distance from the original one. It follows that as a consequence of the 
change in the drive stimulus, the resulting total generalized habit strength 
or reaction potential will be less under a new drive intensity than under 
the original drive conditions, the amount less depending on how far apart 
the drive conditions of learning and testing happen to be. 

It comes as something of a surprise to learn that D, as a multiplier 
of habit strength, holds during learning and testing when D remains con- 
stant , but at the time of testing a change in D always reduces reaction 
potential, even though D is increased. This is all the more surprising be- 
cause the other multipliers (V and K) continue to operate as multipliers 
with changed intensity, increasing reaction potential as they increase, 
and decreasing reaction potential as they decrease. 

The empirical data which give general support to the corollary are 
those of Yamaguchi ( 1952 ) . He found that with starvation periods ranging 
from 3 to 72 hours, with tests at various in-between periods of food 
deprivation, tested responses tended to be at minimum latency when the 
testing and training drives were alike. When tested at other intervals of 
deprivation, either greater or less than the training interval, responses 
tended to be of longer latency. These results are in agreement with the 
corollary, or, to put it the other way, these results gave rise to the corol- 
lary. 

A related experiment by Strassburger (1950), perhaps unfamiliar to 
Hull, yielded results conforming better to what one would expect from 
Hull’s general postulate system, but disagreeing with this corollary. After 
training under various conditions of food deprivation from % to 47 hours, 
then testing under a uniform deprivation of 23 hours, he found no dif- 
ferences in the number of responses to extinction that could be attributed 
to differences in the drive level at the time of original learning, but he did 

52 Paraphrased from Hull (1952a), page 11; (1951), page 90; (1943), pages 235- 
236. 
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find differences attributable to the number of prior reinforcements. This 
is just what Hull’s Postulate 4 would lead us to expect. It is not consistent 
with Corollary xii, however, because change of drive at time of testing 
did not of itself reduce responsiveness. 

Postulate 11. Afferent stimulus interaction in external inhibition 

When a response has been conditioned to a stimulus (SJ and one or more 
previously neutral stimuli (S 2 , S 3 , . . .) are presented along with S 1? the afferent 
impulses from this combination of stimuli interact to yield a new molar impulse 
(s ) . This new impulse is equivalent to a stimulus at a greater or less distance from 
S, on a qualitative continuum. The resulting generalized reaction potential to s 
will be smaller than that to s v depending upon their remoteness from each 
other. 53 

The equation defining the amount of reduction in JE R is: 

lo g~F~ 

d (in j.n.d/s) = pL 

where S E U is the original reaction potential, *E n the reaction potential reduced 
by external inhibition because of the presence of the added stimuli, and / a con- 
stant determined by the form of the gradient of generalization. 

It is clear that the equation attempts to quantify the effects, while 
asserting nothing at all about the kind of patterning or other relationship 
among stimuli responsible for these effects. Furthermore, as presented it 
quantifies merely the limited form of interaction found in external in- 
hibition, and does not cover the many other varieties of stimulus interac- 
tion. Parallel equations could have been developed covering some of the 
other matters that Hull discussed less formally, and he intended that a 
research program would determine the circumstances producing the kinds 
of changes he was ready to quantify. 

$ /This postulate permitted Hull to acknowledge the properties of pat- 
terned stimuli — the essence of the gestalt perception problem. He be- 
lieved that in this postulate he had achieved a rapprochement .with gestalt 
psychology, a sign of the maturing of psychological science. 5 ^ 

Random Variability of Reaction Potential J 

[organisms vary from moment to moment in their ability to perform 
well-established habits. In memorizing nonsense syllables, a syllable cor- 
rectly anticipated on one trial may be missed on the next trial. A stimulus 
threshold is never at a fixed value. The threshold is instead some measure 
of central tendency, with a dispersion of values about it. This body of 

53 Paraphrased from Hull (1952a), page 11; (1951), page 95. An earlier, somewhat 
more general statement, is in Hull (1943), page 47. 
a* Hull (1942), page 77. 
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facts pointing to momentary variation is encompassed in the following 
postulate. 

Postulate 12. Behavioral oscillation ( a O R ) 

A. Reaction potential ( s E a ) varies from moment to moment, the standard 
deviation of the fluctuations serving as a measure of behavioral oscillation 
( s O R ). The distribution in leptokurtic. 

B. Behavioral oscillation ( s O R ) begins with zero dispersion at absolute zero 
of s H r , then rises to an unsteady maximum as S H R increases. 

C. The oscillations of competing reaction potentials are assumed to be 
asynchronous . 55 

The concept of behavioral oscillation, as finally embodied in this 
postulate, intrigued Hull as early as 1917, 56 and it has had a place in each 
of his formal systems. It is instructive to note the changes that have oc- 
curred, because these changes tell us something about the provisional 
nature of system-making at this stage in the history of learning theories, 

1. In 1940 it was the response threshold that oscillated. Hence a 
given reaction potential might suffice to bring a response above threshold 
on one trial, and then (without change in the reaction potential) the 
response might fall below the oscillating threshold on the next. 57 

2. In 1943 behavioral oscillation became an inhibitory appendage to 
reaction potential. Thus the amount of sOr present at the moment of 
responding was always subtracted from the sEr present at that moment. 
The distribution of gO R was assumed to be normal, that is, Gaussian. 58 

3/ By 1952, the oscillation was incorporated into reaction potential, 
and no systematic reason existed any longer for the symbol s O R , it having 
become simply the standard deviation of sE R , and, incidentally, the unit 
according to which reaction potential was quantified. The detailed efforts 
to quantify reaction potential led to several of the refinements reflected 
in die postulate, notably (1) the statement that the distribution of s Er 
is leptokurtic rather than Gaussian, and (2) the discovery that the disper- 
sion of s O R is not constant, but changes with the number of reinforcements. 

\ The deductive use of the concept of behavioral oscillation was also 
extended in 1952 to account for alternation cycles in trial-and-error learn- 
ing. \We shall return to this deduction later in this chapter. 


55 The whole postulate is paraphrased from Hull ( 1952a), pages 11-12. A typo- 

graphical error in his book makes part B of the postulate refer to subthreshold rein- 
forcements where superthreshold reinforcements are intended. See also Hull (1951), 
pages 97, 99; ( 1943), page 319. „ , x 

56 Hull ( 1917). For a note on the history of the use of the concept, see Hull ( 1952a), 


pages 56-57. 

67 Hull and others ( 1940), page 74. 
58 Hull ( 1943), pages 314 and 319. 
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Corollary x in. Response generalization 

A. Response-intensity generalization. When a muscular contraction of 
given intensity has been reinforced, that muscle will contract on successive trials 
with varying intensities in a distribution about the central reinforced region. 

B. Qualitative-response generalization. When the contractions of several 
muscles produce a given habitual act, the combination of the independent varia- 
tion of each produces a qualitative change from the central result originally rein- 
forced. 59 

We have met qualitative change in response in Skinner s treatment 
of response differentiation and in Iris use of spontaneous changes to move 
by successive approximations from one response to another. Hull had 
something of this sort in mind in this corollary. 

Hull mentioned the corollary specifically three times in the 1952 book. 
He referred to it first by showing how, through reinforcing only the re- 
sponses of appropriate intensity, you can shift response intensity along 
a continuum. He pointed out, second, that locomotion is a highly general- 
ized form of response. Walking to one point in space does not differ 
fundamentally from walking to any other point. Third, he referred to the 
principle in accounting for the shift from moving toward a goal to dragging 
the goal-object toward the subject, as in stick and hoe experiments. 60 

^ Reaction Potential and Evoked Response 

i The postulates have moved gradually from stimulus-traces through 
habit strength, drive, and other intervening variables determining reaction 
potential, until now we are ready to consider the emergence of the ob- 
servable, recordable response. Reaction evocation requires, first of all, 
that reaction potential must be above the threshold of response. 

( postulate 13. Absolute zero of reaction potential (Z) and the reaction threshold ^ 

(sLr) 

A. The reaction threshold ( S L R ) lies above the absolute zero (Z) of reac- 
tion potential ( s E r ) . 

B. Response evocation occurs only when the momentary reaction potential 

exceeds the reaction threshold. 61 

r 

The “absolute zero” referred to here is a “relative” absolute, for it is 
based empirically on the number of reinforcements which must be de- 
livered experimentally before there is indication of learning. In the illustra- 
tions used by Hull, the order of magnitude of reinforcements between 

59 Paraphrased from Hull (1952a), page 12; (1943), page 319. The corollary does 
not appear in Hull ( 1951 ) , 

60 These three mentions can be found in Hull ( 1952a), pages 200, 218, and 324. 

61 Paraphrased from Hull (1952a), page 12; (1951), page 102; (1943), page 344. 
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absolute zero and the reaction threshold is two reinforcements of bar- 
pressing. This says nothing of the transfer from prior learning of other 
manipulatory skills or eating habits, and so can scarcely be thought of as 
a true absolute zero of habit strength. In fact, careful experiments have 
shown that the bar-pressing tendency has appreciable strength prior to 
reinforcement. 62 

Corollary xiv. The competition of incompatible reaction potentials 

When reaction potentials to two or more incompatible responses occur in 
an organism at the same instant, each in a suprathreshold magnitude, only that 
reaction whose momentary reaction potential is greatest will be evoked. 63 

The corollary is convenient in the derivation of trial-and-error learning, 
where the choice is, say, between turning right or left. The organism will 
then make the response that has the higher momentary reaction potential. 
This corollary now accounts for probability of occurrence as a measure 
of response strength for competing responses. As a broad generalization the 
corollary is by no means self-evident. Antagonistic muscles may both be 
contracted at once. Then, while either flexion or extension may occur, the 
muscles will be under heightened tension, and the response will differ from 
what it would have been had only one muscle set been active. The pos- 
sibility of compromise response — much emphasized by Guthrie — must 
also not be overlooked. 

/ Postulate 14. Reaction potential as a function of reaction latency 

Reaction potential ( g E R ) can be inferred from reaction latency ( s t R ); the 
shorter the latency the greater the reaction potential. 64 

The logic of the system, as a causal chain, is that latency is a function 
of reaction potential, and the postulate was so stated in early versions of 
the system.\ However, Hulls desire to emphasize the anchorage of the 
system to observables led him in his later versions to turn the matter around, 
and to indicate how the strength of the reaction potential could be inferred 
from latency. The form of the function he expressed as follows: 

g E R = a (s*a)” b 

The effort to quantify S E R in some absolute manner, independent of 
the special units of measurement used in experiments, occupied a great 
deal of Hulls attention during his last years. One of his close associates 
has stated that he believes it “almost certain” that Hull considered the 

62 Schoenfeld, Antonitis, and Bersh ( 1950b). 

63 Paraphrased from Hull (1952a), pages 12-13; (1951), page 104; (1943), page 
344. 

64 Paraphrased from Hull (1952a), page 13; (1951), page 107; (1943), page 344. 
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articles on the quantification of habit strength (and reaction potential) 
to be the most significant of his theoretical papers. 65 

The method used in quantification consisted of a paired-comparison 
technique based on one of Thurstone’s scaling procedures. The basic 
assumption is that latency directly reflects excitatory potential, and the 
comparisons used are between latencies on successive trials for groups of 
animals trained under comparable conditions. Beyond this basic assump- 
tion of decreasing latency with increasing reaction potential, the remain- 
ing assumptions are essentially statistical ones, having to do with the 
deriving of an equal-unit scale. The unit, as previously mentioned, is that 
of cr , the standard deviation of s Er. Because this unit is very general (not 
in grams, millimeters, seconds, or any other measure specific to experi- 
ments), it serves Hull's purposes very well. Having devised this scale, 
Hull was able to co-ordinate other aspects of response to it, and the final 
postulates were based on the assumption that other measures could best be 
stated in terms of the unit derived from the study of latency. 

Postulate 15. Reaction potential as a function of reaction amplitude j 

Reaction potential may be inferred from reaction amplitude. In the case of 
the Tarchanoff galvanic skin reaction in man, the relationship between reaction 
potential and amplitude is a linear one. 06 

The reinforcements in a galvanic skin response experiment with human 
subjects were converted to the new units of sEr derived from the latency 
of bar-passing responses in rats. These scaled values of, sEr turned out 
to bear a linear relationship to the amplitude of the conditioned galvanic 
skin responses as recorded on an uncalibrated apparatus in millimeters of 
deflection. This is the empirical basis for the postulate. 67 

Hull felt that he was on safe ground in confining the amplitude 
postulate to autonomic conditioning (as represented specifically by the 
Tarchanoff phenomenon). Amplitude in striate muscle conditioning is 
complicated by the possibility of limiting amplitude through conditions 
of reinforcement (Corollary xiii), a complication Hull felt to be inap- 
plicable to autonomic responses, 68 

65 Hovland ( 1952), page 349. The major papers are: Hull, Felsinger, Gladstone, and 
Yamaguehi (1947), Felsinger, Gladstone, Yamaguchi, and Hull (1947), Gladstone, 
Yamaguchi, Hull, and Felsinger (1947), Yamaguchi, Hull, Felsinger, and Gladstone 
(1948). 

66 Paraphrased from Hull (1952a), page 13; (1951), page 109; (1943), page 344. 

87 It may be noted, however, that the same argument from the same data was 

possible in 1943, before the more elaborate scaling method was devised. Compare 
Hull (1943), Figure 78, page 340, with Hull (1951), Figure 25, page 109. Original 
data from Hovland ( 1937b ). 

68 Hull (1943), page 339. He knew, of course, that the same complication was 
possible with latency, but he disregarded it on the assumption that the necessary ex- 
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Tostulate 16. Total responses to extinction as a function of reaction potential 

A. The reaction potential ( S E R ) acquired following massed reinforcements 
can be inferred from the number of massed unreinforced evocations (n) re- 
quired to produce experimental extinction. 

B. The reaction potential ( S E B ) acquired following quasi-distributed rein- 
forcements can be inferred from the number of massed unreinforced evocations 
(n) required to produce experimental extinction, but for any given value of n 
the inferred S E R will be higher than under the conditions of Part A, and the 
forms of relationship differ for Parts A and B. c0 

, Hull gave equations for both of these functions, but because he had 
a tendency, especially in the final system, to select any convenient illustra- 
tion for purposes of quantitative statement, it is not at all clear how general 
he expected these relationships to be. That of Part A might conceivably 
have some measure of generality; that for Part B is surely merely illustra- 
tive, for "quasi-distributed reinforcements” during training followed by 
"massed trials” during extinction yield a whole family of functions. The 
particular function presented as an equation in the statement of the 
postulate 16A expressed reaction potential as a function of trials to ex- 
tinction. If inverted to state the causal order of events, the equation implies 
after massed reinforcements a curve of increasing gains for number of 
responses to extinction with increments in reaction potential. For example, 
a reaction potential of 2.0 cr will be followed by 20 non-reinforced responses 
in massed extinction, while one of 3.0 or will be followed by nearly 50 non- 
reinforced responses. After distributed reinforcements (Postulate 16B), 
the corresponding curve is one of decreasing gains with increases in reac- 
tion potential. While for S E R of 4.0 o- there will follow 20 non-reinforced 
responses to extinction, even at the near-maximum of 6 o- there would be 
less than 25 non-reinforced responses to extinction. It is doubtful if the 
contrast between these two functions does more than point to the com- 
plexity of the relationships involved with varying schedules of reinforce- 
ment — a point amply demonstrated by Skinner. 

"10 Individual Differences ^ 

Hull offered the suggestion that the "constants” appearing in his 
equations may vary with the learner, and hence serve as a kind of index to 
individual differences) If, for example, rate of human learning were found 
in some context to be a function of measured intelligence, then the in- 

perimental precautions could be taken. For criticism of latency as a measure see Mueller 
(1950), Skinner (1950). 

69 Paraphrased from Hull (1952a), page 13; (1951), pages 112-114. The relation- 
ship between *Eb and n had earlier been assumed to be linear ( 1943), page 344, while 
in 1951-1952 the functions are both curvilinear. 
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dividual IQ (or some equivalent of it) might appear as a parameter in 
the equation relating the number of reinforcements to the achieved reac- 
tion potential. 

Postulate 17. Individual differences 

The “constant” numerical values appearing in the equations stated in the 
basic postulates and corollaries vary from species to species, from individual to 
individual, and from some physiological states to others in the same individual 
at different times. 70 

The treatment of individual differences was purely programmatic, for 
the postulates and corollaries as stated were based on responses from 
groups of subjects. Furthermore, the constants were usually obtained from 
curve-fitting, and hence do not have the rational status desirable in inter- 
changeable constants subject to independent measurement. 

The problem sensed by Hull is an important one, and could be faced 
provisionally with groups of subjects tested under several conditions. Con- 
sider, for example, the constants in the curve for serial-position effects, 
worked out in the rote learning system. The three constants are L (the 
reaction threshold), AK (the amount of inhibition generated by each 
repetition of each nonsense syllable ) , and F ( a factor of reduction whereby 
inhibition is progressively reduced at syllables more remote from the 
point of origin of the inhibition). If the same learners participate through- 
out, and everything else is kept constant through appropriate experimental 
controls, then the constants should change in some systematic way for 
groups of learners of different ages. For any one group of learners the 
constants should not change when new conditions are introduced, such 
as length of list, which have no bearing on the constants. We have no evi- 
dence of this kind at the present time to support the adequacy of constants 
as measures of individual differences. The logical possibilities are clear 
enough, and it is these possibilities to which Hull called attention in the 
postulate. 

^ The Antedating Goal Reaction and Its Role in Secondary 

Reinforcement 

Hull ended his final set of postulates and basic corollaries with the 
following one. 

Corollary xv. Secondary reinforcement by fractional antedating goal reactions . 

When a response (R) is associated with a stimulus trace (s) and this 
stimulus-response conjunction is accompanied by an antedating goal reaction 

70 Paraphrased from Hull ( 1952a), page 13; ( 1951), page 117. The matter was first 
dealt with systematically in Hull ( 1945). 
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(r G ) and by the stimulus resulting from it ( s G ), there will result an increase in 
the tendency for that stimulus ( $ ) to evoke that response (R). 

The reinforcement comes about because of the secondary reinforcing 
powers of $ G . 71 

In this corollary and in the discussion of it, Hull says nothing more 
about the role of $ G in primary reinforcement (Postulate 3). This omission 
supports the earlier interpretation that the addition of s G to Postulate 3 
was an afterthought. The secondary reinforcing power of Sa in Corollary 
xv is that of any stimulus that has been closely associated with primary 
reinforcement (see Corollary ii). 

We have now completed a summary of the postulates and corollaries 
presented in the first chapter of Hull’s last book as the basic “laws” of 
his system. In later chapters he added only two more principles given the 
status of corollaries, although he stated 133 theorems. The corollaries 
must have had special meaning for him, as somehow fundamental, and 
they therefore merit discussion here. 

Corollary xvi. Generalization between reaction ‘potentials to positive and nega- 
tive stimuli in simple trial-and-error learning 

A. When an organism acquires two reaction potentials S E R+ and S E R _ to 
similar stimuli S + and S_ in the evoking of responses R + and R_, which, while 
different, use substantially the same muscles, a gain in S E R+ through reinforce- 
ment will be accompanied by a generalized gain in S E R _. 

B. When the reaction potential to the negative stimulus (sE R __) loses 
strength through extinction, a generalized loss in S E R+ will accompany this 
loss in S E R __. 72 

The relationships in this corollary follow rather directly from stimulus 
and response generalization (Postulate 10 and Corollary xiii). 

Corollary xvii. Experimental extinction 

In non-reinforced trials following regular reinforcement, the evoking stimu- 
lus (1) for a time will continue to evoke the response, (2) sometimes will evoke 
the response for a time with a rise in reaction potential, (3) the rise (if any) will 
be followed by ~ fall in reaction potential, and (4) the course of the 

fall in reaction p- * he reverse of a positive learning curve. 73 

Why a new corollary, when the same facts were covered in Postulates 
9A and 9D? Except for the statement about an occasional rise in reaction 
potential with non-reinforcement, there is nothing new in the corollary 
itself. The novelty rests instead in some of the thinking that lies behind 
the corollary. Hull, renewing an earlier suggestion, 74 described the realiza- 

71 Paraphrased from Hull (1952a), pages 14 and 125. 

72 Paraphrased from Hull ( 1952a), page 23. 

73 Paraphrased from Hull (1952a), page 134. 

7 4 Hull (1937). 
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tion of an anticipation as occurring when actual eating (So >Rg) fol- 
lows upon the antedating sequence (r G >s G )- When the anticipated S G 

is omitted, as in non-reinforcement, the anticipation is frustrated. Frustrated 
anticipation may lead to more intensified activity ( heightened S E B ) through 
aroused emotion (hence heightened D). This accounts for that portion 
of the corollary suggesting temporary increase in reaction potential early 
in extinction. In the end, the accumulation of I R will lead to diminished 
s E r . The decay process is enhanced, however, by the conditioning of in- 
hibition to s G , yielding $ g Ir. Because this is learning, we have reason to 
expect the form of the extinction curve to reflect ordinary learning, though 
in reverse. 

In this corollary we have a good indication of unfinished business, 
as Hull, clear to the end, was attempting to rethink his systematic 
formulations. 

The System Summarized 

The set of postulates and corollaries became somewhat formidable, 
and in places somewhat fragmented as Hull attempted to work into the 
basic principles not only all manner of quantitative relationships found to 
hold in classical and instrumental conditioning, but other kinds of phenom- 
ena needed to deal with types of problems beyond these reference experi- 
ments. Even so, it is possible to cut through some of the specific detail, 
and to summarize the system in rather direct fashion as a chain of anchored 
constructs beginning with the antecedent conditions (input), moving 
through the intervening variables to response (output). Such a summary 
is given in Figure 15. 

In Column I we have input conditions, all except S 'H R defined by 
objective experimental conditions. ( s 'Hr is habit strength from a related 
habit, to become expressed as generalized habit strength, s Hr. ) 

In Column 2 we find the intervening variables most closely tied to 
the antecedent conditions. In Column 3 we assemble, in an intermediate 
step, the consequences of the simultaneous presence of the variables in 
Column 2. We are now (in Column 4) close to response evocation, but 
we must first take into account the oscillation of reaction potential ( s Or) 
and the threshold of response ( S L R ). Finally (in Column 5) the response 
emerges, with the measurable characteristics of latency ( s f R ), amplitude 
(A), or number of non-reinforced evocations to extinction (n). Formerly, 
probability of response was also used as a response measure, but in the 
final system the probability measure was felt by Hull to be appropriate 
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FIGURE 15. Summary of Hull's final system. The input and output variables, and 
the intervening variables, are symbolized as follows in the figure: 


Column ( 1 ) 

N, number of prior reinforcements 
Cd, drive condition 
S, stimulus intensity 
to , amount (weight) of reward 
s'Hr, strength of a habit based on same 
response conditioned to another stim- 
ulus 

W, work required in responding 

Column (2) 

&Hk, habit strength 
D, drive 

V, stimulus-intensity dynamism 
K", incentive motivation 
sHb, generalized habit strength from re- 
lated habit 


Ib, reactive inhibition 
sin, conditioned inhibition 

Column (3) 

sEr, reaction potential 

sEn, generalized reaction potential 

sin, aggregate inhibitory potential 

Column (4) 

sEr, net reaction potential 

sOr, oscillation of reaction potential 

sLr, reaction threshold 

Column (5) 
stn, reaction latency 
A, reaction amplitude 
n, number of non-reinforced responses to 
extinction 


only when conflicting responses were involved, and hence probability is 
inappropriate as a measure from which to infer sE^. 75 

The diagram falls short in not listing the several important stimulus 
components making up the complex of traces present when a response is 

75 The original probability postulate occurs in Hull ( 1943), page 344. The discussion 
of reasons for withdrawing it may be found in Hull (1951), page 126. Corollary xiv 
permits use of the empirical data formerly included in the probability postulate. 
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evoked, important among which are the drive stimulus (S D ) and the 
fractional antedating goal stimulus ( s G ). The kinds of items serving as 
components of the stimulus trace are listed in Table 2. Because these 
components are not all contemporary in origin, some of them representing 
the subsiding residues of just prior stimulation, others (on the basis of 
acquired antedating responses) representing events still to come, the 
complex pattern of stimuli makes possible the linking together of response 
patterns covering large segments of time and space. The stimulus com- 
plex contains traces that both activate behavior and guide it. 


TABLE 2, The Components of a Stimulus Trace 


Origin of Stimulus-Trace 
Component, with Symbol 

Resulting Component of 
the Stimulus Trace 

Environmental stimulus .... S 
Compound stimuli Si, S a ... 

Drive D 

Response R 

Fractional antedating goal 
response r G 

s Molar afferent impulse 

s Molar afferent impulse modified by 

afferent interaction 

S D Drive stimulus 

Proprioceptive consequence of re- 
sponse 

s G Fractional antedating goal stimulus 


DERIVED INTERMEDIATE MECHANISMS 

Hull’s system is characterized by him as molar, by which he means 
non-physiological. That is, the inferences to intervening variables do not 
depend upon the discovery of the physiological substratum which is causal 
in a “molecular” way. The assertions about molar stimulus traces, or af- 
ferent interaction, need not include specification in terms of the con- 
stitution of nerve bundles or the characteristics of the nervous impulse. 
Hull vacillated somewhat by including physiological language in his postu- 
lates. In any case, Hull’s system classifies as a reductive system, in that 
more complex phenomena are deduced on the basis of presumably simpler, 
more basic phenomena and relationships. In this sense the more complex 
are “reduced” to the simpler through analysis. It is characteristic of all 
systems of this kind that they intend to explain behavior that is super- 
ficially unlike the behavior from which the postulates are derived. In other 
words, Hull did not intend merely to systematize the account of rat lever- 
pressing, from which most of the data for his later set of postulates derived. 
He intended to arrive at the basic laws of behavior, at least the laws of 
the behavior of mammalian organisms, including the social behavior of 
man. 

In order to bridge the gap between the simple laboratory experiments 
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furnishing the constants in the postulates and corollaries, and the more 
familiar behavior of organisms adapting to a complex environment, he 
derived intermediate mechanisms . Once these mechanisms were available, 
they opened the way to the explanation of many more varieties of behavior. 
We shall consider two of these mechanisms: the gradient of reinforcement 
(originally the goal gradient), and the habit-family hierarchy. 

We have to distinguish between the historical and the logical order 
of postulates and intermediate mechanisms. Many of Hull’s most brilliant 
deductive accounts of complex phenomena were made in early papers, 
prior to the postulate systems. 76 Many of these accounts were essentially 
unchanged when they were reworked for the 1952 book, even though 
in the meantime the postulate system underwent substantial revision. 
From a logical point of view, however, once the system has been con- 
structed , the postulates take priority. The same applies, of course to 
predictions. An experimental fact may be “predicted” by a system, even 
though the fact has long been known. The only advantage in predicting 
new (unknown) facts is that, in predicting them, there is less temptation 
to include the facts in the postulates used as a basis for their prediction. 

\ / 

The Gradient of Reinforcement ( the Goal Gradient) 

There are two chief kinds of time gradients involved in conditioning 
experiments. The first (found primarily in classical conditioning) is based 
on the time between the conditioned stimulus and the unconditioned 
stimulus, short forward intervals being found most satisfactory. Hull took 
this interval into account in developing his postulate regarding the stimulus 
trace ( Postulate 2B ) . The assumption is that conditioning is actually simul- 
taneous, but the gradient is produced by a rising and falling molar trace 
set into motion by the external stimulus. The second kind of time gradient 
(found chiefly in instrumental conditioning) is based on the time between 
the response to be strengthened and the reinforcement. This second gradi- 
ent is considered in the corollary concerned with delay in reinforcement 
(Corollary iiiB). As pointed out earlier, the simple gradient (a single 
response reinforced) is assumed to extend for about 5 seconds; if a rein- 
forcement is delayed more than 5 seconds, its effectiveness must depend 
upon other mechanisms. 

The measurement of the gradient of reinforcement depends upon 
experiments of the instrumental conditioning type, in which an act such 

76 For an introduction to the deductions available before 1940, see the summaries in 
Hilgard and Marquis ( 1940) : maze learning, 216-221; serial verbal learning, 221-226; 
reasoning experiments, 236-241; circumventing a barrier, 242—243. 
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as pushing a lever or running down an alley leads to the goal-object and 
reinforcement. Delay may be introduced between the act to be rewarded 
and the goal-object which provides the reinforcing state of affairs. Thus 
the delivery of the pellet of food which is the reward for lever-pressing 
may be postponed for an arbitrary number of seconds in order to study 
the effect upon rat learning of such a delay. The gradient of reinforcement 
is derived from such delayed reinforcement experiments, as distinguished 
from the stimulus-response separation experiments of classical condition- 
ing. Hull at first believed, on the basis of a delayed reinforcement experi- 
ment done with rats in the lever-pressing box, that the basic gradient of 
reinforcement was fairly short, in the case of the rat, "possibly no more 
than thirty seconds and very probably less than sixty seconds.” 

Other experiments on the effect of delayed reward, such as those of 
Wolfe (1934), show that in a simple alley maze the reward is effective 
for learning with much longer delays, at least as long as 20 minutes, the 
longest delay tested. A gradient is shown here, also, with the longer in- 
tervals less effective for learning than the shorter ones. 

Let us see how these gradients are related, before we turn to the 
manner in which they are used to derive other forms of behavior. The 
final theory somewhat unclearly implied four steps: 

1. The influence of reinforcement is solely upon the immediately 
concurrent stimulus and response units. There is no genuine primary gradi- 
ent of reinforcement. In this Hull follows Spence (1947), and he has 
eliminated delay in reinforcement from his postulate set. 

2. For a single stimulus-response conjunction followed by delayed 
reinforcement there is a short gradient of reinforcement, extending about 
5 seconds. This is expressed in Corollary iiiB. The delay is derived on the 
basis of r G and its s G , especially via the secondary reinforcement role of s G . 

3. For links in a chain of responses, with terminal reinforcement, the 
links farther removed from reinforcement are less strongly reinforced. 
This is a longer gradient than that for the unchained single response, and 
it depends upon the secondary reinforcement provided by the enviro nm ent 
and by the organism’s own movements. /This is the principle to account 
for the gradients in minutes rather than in seconds. It is described qual- 
itatively in Corollary iiiA, but is not derived. While later it is treated 
quantitatively, the quantities are assigned on the assumption that the 
gradient exists in appropriate magnitudes. Nowhere is it linked quanti- 
tatively with the gradient of Corollary iiiB just described. 

4. In any actual chain of responses as observed and recorded, the 
gradients of reinforcement affecting individual links in the chain summate 
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in complex fashion to produce empirical gradients 77 bearing little re- 
semblance to any single inferred underlying gradient of reinforcement. 

To be strictly elegant in system construction, we ought to be able to 
move by formal steps from each one of the foregoing numbered proposi- 
tions to the next. We can follow Hull's attempt to get from (1) to (2), 
though his derivation involved outmoded features of his system. 78 He 
moved from (2) to (3) very informally, without attempting to construct 
a sound bridge between them. He made a more serious, and more success- 
ful, effort to move from (3) to (4). To that step we now turn. 

This is our first illustration, in any detail, of how Hull proceeded 
to use an intermediate mechanism in accounting for concrete behavior in 
a complex situation. Beginning with the assumption of the long gradient 
of reinforcement of point (3) above (an assumption already involving 
many steps from the basic postulates), the problem is to account for the 
actual latencies of response at each step of a four-response chain, leading 
to reinforcement at the end. 

Let us begin with the empirical data that we wish to derive. They 
come from an experiment by Arnold. 79 A rat learns to press a button 
presented outside the window of its restraining cage, whenever the shutter 
is opened. Pressing the button is rewarded by a pellet of food. Now the 
experiment proper begins. A clever device presents to the rat a chain 
of four stimulus-response possibilities by means of a car running by his 
window on a track. In this experiment, there are four parts of the car, 
all alike, all with buttons exactly like the one he already learned to press. 
When the shutter is opened, button B x is presented. He presses it, and 
starts the car in motion, though no pellet appears. Three seconds later the 
second button B 2 has come into view as the car stops with it before the 
window. He presses it; now button R a appears. In three more seconds he 
has the opportunity of pressing B 4 . Because the response was well learned, 
there is not enough extinction in these few non-reinforced trials to pre- 
vent the response. Having made the four responses in order, the shutter 
falls, and the food reward appears, giving terminal reinforcement to the 
whole linked series. The series of events is presented once a day, and a 
measure of reaction latency taken at each button. The results for trials 

77 The term “empirical gradient” is a convenient term for the gradients actually 
found. At one time the distinction was made between the gradient of reinforcement 
as the gradient of habit strengthening, and the goal gradient as the effect upon per- 
formance, a distinction first suggested by Miller and Miles (1935). But when the 
gradient of reinforcement came itself to be derived rather than basic, the former 
distinction between it and the goal gradient was no longer maintained by Hull. 

7 8 Hull ( 1952a), pages 127-128. 

79 Arnold ( 1947). Additional experiments in the series were also considered by Hull. 
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2-10 are shown in Figure 16. While there is a kind of gradient of rein- 
forcement, with latency getting shorter nearer to the goal, there is a 
striking upturn at B 4 . The theoretical problem is to derive this empirical 
gradient from the overlap of gradients, all of which fall off from the point 
of reinforcement. 



0 i n m - ' 

ORDER OF MANIPULANDA PRESENTATION 

FIGURE 16. Reaction latency of responses to each of the four stimuli in a be- 
havior chain with terminal reinforcement. Adapted from Arnold (1947) mee 356 
and Hull (1952a), page 164. h r & ooo. 

The steps involved are as follows: 

1. On the basis of prior learning, there is the original S E R available 
equally to B„ B,, B s , and B 4 . Some initial value must be assumed. (Hull 
arbitrarily assumed 2.0 <r.) 

2. The general form of the gradient of reinforcement with time is 
assumed, and both its form and the number of trials required to generate 
it are arbitrary, though the form is consistent with gradients of gener- 
alization. The resulting increments of reaction potential at each of the 
stimuli turn out to be: B 1} .234; B 2 , .380; B 3 , .617; B 4 , 1.000 u. This conforms 
to our expectation of greater gains nearer to reinforcement, because rein- 
forcement follows B 4 . 

3. The original s Eb (2 tr) and the increment through reinforcement 
must be added for each link in the chain, according to the principle by 
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which reaction potentials are combined. The sum will, of course, be less 
than the arithmetical sum. The result turns out to be: B u 2.16; B 2? 2.25; 
Bjj, 2.41; B J? 2.67. These values Hull calls the g mclient-of-reinforcement 
values in this context, because they are the values at each link before we 
consider the interaction of the different links. 

4. We must now take into account the fact that a stimulus does not 
cease to operate when it is withdrawn, but its trace persists. Hence that 
persisting stimulus trace will derive reinforcement from each of the sub- 
sequent secondary reinforcements along the route to the goal, and through 
generalization the effect will be felt at the next onset of the stimulus. 
Similarly, each response along the way contributes strength to the links 
preceding it (whose traces are represented when it is being reinforced) 
and to the responses following it (because its stimulus trace will be part 
of the stimulus complex when these succeeding responses occur). This 
two-way contribution of each link to the others is computed, again through 
arbitrary formulas whose exponents have no relationship quantitatively to 
those given in the postulates, though the form of the equations is as 
postulated. The results are given in Table 3. 

TABLE 3. Steps Used by Hull in Computing the Theoretical Mean Reaction Laten- 
cies at the Response Points of a Four-Link Response Chain. The gradient of rein- 
forcement values, from which the remaining values in the table are generalized, are 
shown in bold type. See text. From Hull (1952a), page 162. 




Components of Reaction Potential 

( S Er) 


Assumed d values 
( in j.n.d/s ) 

Response number 

Bi 

4 

B 2 

2 

1 

B 3 

Bi 

Values based on delay 
in reinforcement 

9 seconds delay 

2.16 

.14 


.03 

.02 

6 seconds delay 

.57 

2.25 


.57 

.28 

3 seconds delay 

.30 

1.21 


2.41 

1.21 

0 seconds delay 

.24 

.95 


1.89 

2.67 

Behavior sums ( 4- ) of «E e 
Reaction latencies ( S £ E } 

2.82* 

3.54* 


3.78* 

3.46* 

3.51" 

2.19" 


1.91" 

2.30" 


5. Now we add up the various components of reaction potential at 
each link in the chain. The addition is again by formula, and not straight 
arithmetic. The sums are given in the row second from the bottom of 
Table 3. 

6. The final step is to convert these sEr's to latencies ( sfiis ) . Here 
again an arbitrary formula, with new constants but familiar form, is used, 
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and the latencies computed, as shown in the bottom row of Table 3. This 
is the end of the determination, and the results are plotted in Figure 17, 
for comparison with Figure 16. 

The general agreement between the two figures is striking enough to 
indicate Hull's ingenuity. There is some disagreement in the absolute 
magnitudes, however, and this led Hull to assert: 

This degree of agreement between experiment and theory is, perhaps, as 
close as may reasonably be expected in the present early stage of the science. 80 

The agreement lies in the form of the function. The failure lies in 
the latency measure — a matter of really great importance to the system 
because of the reliance on latency for quantifying S E B . Hull had earlier, 



FIGURE 17. Theoretical reaction latency of responses to each of the four stimuli 
in a behavior chain with terminal reinforcement. Compare with the empirical results 
shown in Figure 16. Adapted from Hull (1952a), page 163. 

in choosing the equation for the computation of latency, multiplied the 
values derived from Postulate 14 by an arbitrary constant (3-f-) because 
he noted that the values would otherwise have been too small, 81 then two 
pages later he finds that he had not boosted the latency enough. He could 
easily have chosen a constant to make the values correspond much more 

Hull (1952a), page 165. 

81 Hull (1952a), page 163. 
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closely, by multiplying by 50 instead of multiplying by 3. Possibly he 
worked the matter out independently, and then felt it would be deceptive 
to work backwards from the result, or thought a 50-fold correction was 
too great. Because the constants are arbitrary anyhow, he might as well 
have worked backwards, and achieved as good a fit as possible. Had the 
constants been independently determined, there would have been some 
point in seeing how much discrepancy remained. As it stands, the pre- 
diction of the form of the function is an achievement. Everything else 
is arbitrary, and consequently the achievement itself has much that is 
ad hoc in it. 

The main point of this detail with respect to the derivation of the 
gradient for a chain of responses has been to point out how the derivation 
depends primarily upon the intermediate mechanism (the gradient of 
reinforcement), but then uses other principles (e.g., generalization, sum- 
mation of reaction potentials, conversion of reaction potential to latency) 
to achieve a predicted result, an empirical curve that bears little resem- 
blance to the principles used in its deduction, but corresponds closely in 
form to the results of a complex learning experiment. The deduction 
points out also how deceptive is the apparent quantitative rigor of the 
system. The quantities in the postulates are mere illustrations of 
quantities, and are not, in fact, made use of in any consistent manner in 
later derivations. 

While the gradient of reinforcement is not the primary one it was 
earlier considered to be, it is a derivative or intermediate principle, which, 
once established, again serves its former purposes in the explanation of 
more complex learning. The original application was to the maze. The 
principle mediated the deduction that responses nearer to the goal would 
be more strongly conditioned than those farther removed, so that short 
paths would be preferred to longer ones, blinds near the goal would be 
eliminated more readily than blinds farther away, longer blinds would 
be more readily eliminated than shorter ones, and so on. 83 The goal 
gradient principle was later applied to field-force problems as studied by 
Lewin. 83 For example, in experiments involving the circumventing of 
barriers between the learner and a visible goal, Hull proposed that the 
reaction to the perceived goal-object should behave in accordance with 
the goal gradient That is, the nearer the learner came to the goal, the 

82 Hull (1932). For an introductory exposition, see Hilgard and Marquis (1940), 
pages 216-221. For the earlier history of the problem of the goal gradient, see Hull 
(1943), pages 159-160. 

83 Hull (1938). The problems were such as proposed by Lewin (1933a) (1935). 
The deductions are repeated with little change in Hull (1952a), pages 262-268, 
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stronger should be its response-evoking power. Thus Hull, by way of 
the gradient of reinforcement, came to conclusions similar to those de- 
scribed by Lewin as goal-attraction in relation to distance. 

The Habit-Family Hierarchy 

A second derived principle is that of the habit-family hierarchy. 
This is not included among the postulates because, like the gradient 
of reinforcement, it is a principle at intermediate level, being itself de- 
rived from more basic principles. It carries great weight, however, in 
the deduction of further behavioral phenomena. 

Because there are multiple routes between a starting point and a 
goal, the organism learns alternative ways of moving from a common start- 
ing point to a common goal-position where it finds need satisfaction. 
These alternatives constitute a habit family because of an inferred inte- 
grating mechanism. The integration into a family is by way of the frac- 
tional antedating goal reaction , present as each alternative is active. 
The fractional anticipatory goal reaction provides a stimulus (sq) to 
which all overt responses are conditioned. Through the differential action 
of the gradient of reinforcement, some responses are less strongly con- 
ditioned to s G than others. The starting responses of longer routes, for 
example, are more remote from reinforcement than the starting responses 
of shorter routes. Hence the latter are more strongly reinforced, and 
more strongly conditioned to s G . As a consequence, the alternative be- 
havior patterns are arranged in a preferred order. The less favored routes 
are chosen only when the more favored are blocked. It is this set of al- 
ternative habits, integrated by a common goal-stimulus, and arranged in 
preferential order, that constitutes a habit-family hierarchy . 

It is further deduced by Hull that if one member of a habit-family 
hierarchy is reinforced in a new situation, all other members of the 
family share at once in the tendency to be evoked as reactions in that 
situation . 84 This makes possible the explanation of response equivalences 
and other appropriate reactions in novel or problematic situations, such 
as those found in insight and reasoning experiments . 1 

The principle was first applied to maze learning , 85 serving chiefly to 
explain the tendency for the rat to enter goal-pointing blinds, even though 
such entrances had never been reinforced in the maze situation. Goal 
orientation was taken to represent an inappropriate transfer of spatial 
habits acquired in free space. Another application was in relation to the 

84 Formal deduction was first given by Hull ( 1937), page 27. 

85 Hull (1934a). 
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detour experiments. 86 The difficulty of turning away from a perceived 
goal beyond a barrier depends on the presence of habit-family hier- 
archies as well as upon goal gradients. In the usual experience of free 
space, the favored path is the straight line between the learner and the 
goal. The next-favored starting response is that making least angle with 
the goal. The greater the angle, the less favored is the starting response 
in that family of habits built up in previous experience. Hence, when 
blocked, the learner prefers a path which goes off at a right angle to one 
which requires that he turn his back on the goal. In some objective situa- 
tions he may come to choose a longer path to a shorter one, if the habit- 
family hierarchy proves to be misleading. 

SOME REPRESENTATIVE DEDUCTIONS AND THEIR 
EXPERIMENTAL TESTS 

We have met one illustration of Hulls deductive procedure in his 
computation of reaction latencies at each link of a four-link reaction 
chain with terminal reinforcement. His system commanded its initial at- 
tention because of his skill in carrying out corresponding computations 
in a variety of fields of experimentation. Summaries of many of his 
earlier deductions are already available. 87 A number of these deductions 
were repeated in the final book, with moderate changes to conform to 
the new system. The three illustrations chosen for discussion in what fol- 
lows are, in substance, new to the final book (response alternation in 
trial-and-error learning, latent learning, and adient-abient conflict). Al- 
though they represent but a small sample of the total available deduc- 
tions in the final system, they will serve to demonstrate Hull’s method of 
arriving at new phenomena with the help of familiar postulates. 

Response Alternation in Trial-and-Error Learning 

Thorndike was impressed by the gradualness of trial-and-error learn- 
ing. The gradualness implied that a response, once correct, was not con- 
sistently repeated thereafter, and that an incorrect response, once elimi- 
nated, must have cropped up occasionally. Hull attempted to come to 
closer grips than Thorndike did with this problem of response alterna- 
tion in trial and error. 

It will make the problem more concrete to begin first with the em- 

86 Hull (1938). 

87 In addition to the summaries on serial learning and problem-solving in Hilgard 
and Marquis (1940), on rote memorization and related phenomena, see the first edition 
of this book, Hilgard (1948). 
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pirical results, and then to turn to the deduction. In one of his own 
experiments Hull (1939b) used a modified Skinner box, as sketched in 
Figure 18. The experimenter could present either the vertical or the 
horizontal bar alone, or both together. The rat first learned to operate 
the vertical bar alone, and continued to operate it for 15 food-reinforced 
responses. The following day the experimenter presented only the 



FIGURE 18. Diagrammatic representation of box used to study trial-and-error in 
the two-lever problem. Either the horizontal bar (H) or the vertical one (V) could 
be made positive; that is, when tire appropriate bar was pressed a pellet of food fell 
through the tube (T), into the food-dish (R). After Hull (1939b), page 237. 

horizontal bar, and the rat learned to operate it for 60 food-reinforced 
responses. The crucial test came the next day. The experimenter first 
allowed a warm-up by permitting four reinforcements of the already 
strong horizontal bar habit. Then he presented both bars at once, but set 
the apparatus so that only response to the vertical bar led to reinforce- 
ment This became a simple trial-and-error experiment, requiring a re- 
versal from the stronger and more recently reinforced to the weaker and 
earlier reinforced habit. 

All 76 animals that finally mastered the trial-and-error learning met 
the criterion of learning gradually, responding now to the horizontal bar, 
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now to the vertical bar. Sometimes the rat would respond to one bar 
for several trials, then respond to the other for a run of trials. Hull 
grouped these vacillating runs into what he called alternation cycles. 
He defined a cycle as including two phases, a run of responses to the 
vertical and a run of responses to the horizontal, such runs being pre- 
ceded and followed by changes of response. Thus the cycle lasts from a 



ALTERNATION CYCLES 


FIGURE 19. Responses per alternation cycle in simple trial-and-error learning 
when the incorrect response was originally more strongly conditioned than the cor- 
rect one. Data from Hull ( 1939b), as plotted in Hull (1952a), page 52. 

shift in a given direction (say from horizontal to vertical) through the 
shift in the opposite direction (from vertical to horizontal) and con- 
tinues until the original shift occurs again. This corresponds to the usual 
definition of a two-phase cycle. The length of the cycle is determined 
by the number of responses in the runs making up the two phases of 
the cycle. 
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In order to exaggerate the effect somewhat, Hull selected the data 
from the 25 animals (among the 76) which yielded from four to six com- 
plete response-alternation cycles. His plot of the number of responses 
per alternation cycle for the first four cycles is reproduced as Figure 19. 

Can such a curve be predicted on the basis of the postulates of the 
system? Here is how Hull went about it. 

He first made use of an earlier derivation of the course of a proba- 
bility-of-reaction curve for learning under these conditions. His earlier 
computations had resulted in a table of reaction-probabilities of response 
to the positive and to the negative stimulus on successive occasions. First 
the reaction potentials to the positive and negative stimuli were com- 
puted in cr units. "The probability of a response occurring to one or the 
other of the stimuli depends upon the amount of separation of these two 
reaction potentials, because they are in conflict. Because the potentials are 
based upon o- units, he could use the familiar formulas for significance of a 
difference and determine a critical ratio, d/a&. The probability of reaction 
can be determined by the probabilities associated with this ratio. The 
result of this earlier computation was a sigmoid curve of acquisition of 
the positive response. 88 

While Hull seems here to be making simply a familiar statistical 
computation, the psychological (and systematic) assumptions underlying 
it are consonant with the assumptions used in scaling S E K , because the 
a unit used in the computation is the inferred behavioral oscillation 
(sOk). 

From the values entering into the probability-of-reaction learning 
curve, he moves on to response alternation without additional psycho- 
logical or systematic assumptions. The next steps are purely statistical, 
and depend solely upon principles of probability. The principles are 
used with considerable ingenuity to derive the curve that is plotted as 
Figure 20. What this curve says is that response cycles will at first be 
relatively long (consisting mostly of the negative responses, not yet ex- 
tinguished), that they will fall to some minimum length, and then in- 
crease in length again (at the end consisting largely of positive responses, 
now overcoming through reinforcement their original handicap relative 
to negative responses)! 

Hull has now, by way of a number of steps consistent with his 
theory, supplemented by probability considerations, derived a theoretical 
curve that shows a strong family resemblance to the empirical curve of 
Figure 19. The abscissa units are not alike, and precise comparison is not 
88 Hull ( 1952a), pages 30-33. 
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possible; but the resemblance is strong, and the deduction is in this 
case an unusually rigorous one. 

The plotting o£ response-alternation cycles exhibits another of Hull's 
strengths — an ingenuity he showed in discovering or inventing new ways 
to summarize the data from familiar types of experiment. The theoretical 
basis for the curve shown in Figure 20 permits other deductions under 
different conditions of experiment, so that this figure is but one of a 



FIGURE 20. Theoretical responses per alternation cycle when the incorrect re- 
sponse was originally more strongly conditioned than the correct one. Compare with 
empirical curve of Figure 19. After Hull (1952a), page 48. 

family of curves. Mathematical formulation has the advantage that shifts 
in the form of functions can be predicted when the parameters change. 
These shifts are often very different from those that might be guessed or 
intuited. If they are then confirmed experimentally, the theory receives 
a genuine support. 

^ Latent Learning 

Sometimes, when a new incentive is introduced, animals perform- 
ing at a very mediocre level without the incentive make sudden gains, 
bringing them abreast of animals trained all along with the favored in- 
centive. Because the new incentive seems to bring out concealed or 
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unused learning, experiments of this kind, when successful, are said to 
reveal latent learning. Such experiments were formerly a source of con- 
troversy between reinforcement theorists and cognitive theorists. The 
cognitive theorists cited these experiments in support of their position 
and pointed out the difficulties for the reinforcement theorist.Tn the type 
of theory Hull held in 1943 they were indeed difficult to account for, 
because S H R then depended upon the magnitude of the reinforcement. 
,With the change in emphasis from S H R to S E R , Hull no longer had diffi- 
culty with latent-learning experiments and proceeded to deduce their 
results as consequences of his new theory.]/ 

It is doubtful that Hull deliberately shifted his system in order to 
account for latent learning, for the facts of latent learning were at the 
time very much in doubt, and his followers were still very critical of the 
results of earlier latent-learning experiments. Another possibility is that 
the shift from S H R to S E R occurred because of Hull’s preoccupation with 
quantification. He found that he could quantify S E R , but was far from 
quantifying S H R . Hence it was preferable to write more equations in 
terms of S E R . 89 

The method by which Hull derived latent learning is simplicity 
itself. x So long as there exists a modicum of reinforcement, g H R builds 
up as a function of the number of trials. The revealed g E R will be small, 
and the revealed learning slight, because one of the multipliers (K, the 
incentive reinforcement) is small. Now when K is made large, through 
the introduction of the major incentive, we have a new multiplier, and 
the amount of S E R (for the same S H R ) will be much greater. This is 
largely what is meant by latent learning: a sudden gain in S E R with the 
introduction of incentive.'' 

Hull’s deduced latent learning is shown schematically in Figure 
21. Shifts of incentive, either upwards or downwards, within a few trials 
place the organism on the learning curve characteristic of animals trained 
all along with the new incentive/ Hull accepts the familiar latent-learning 
data of Tolman and Honzik (1930) as supporting his prediction. 00 

\ 

Adient-Abient Conflict ) 

The postulates and corollaries include references to anticipatory goal 
responses and to a factor (K) having to do with the magnitude of in- 
centives used in prior reinforcements. They assert nothing, however, about 
perceived incentives. Hence Hull saved the problems of perceived in- 

89 This is also Koch's interpretation, Koch ( 1954), page 106, footnote 29. 

90 We shall meet latent learning again in the next chapter. 
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centives to be treated through theorems derived from the more basic 
postulates. 

Perceived incentives are very important in ordinary behavior. Con- 
sider a cat lying in wait to pounce upon a bird, or a buzzard circling 
in search of carrion, or a male dog attracted by a female in heat. The 
ongoing behavior is very much influenced by the perceived incentive. 



FIGURE 21. Hull’s theoretical prediction of latent learning. The upper curve rep- 
resents the course of learning with a large incentive ( K = .9 ) throughout. The lower 
curve shows the course of learning with a smaller incentive (K = .6) The first curve 
that crosses between the two predicts what will happen if the incentive is increased 
at trial 10 for animals previously trained with the smaller incentive (the typical 
latent learning result). The next two crossings represent the result of a decrease in 
incentive at trial 15, and an increase at trial 20. After Hull (1952a), page 144. 


"Perceived incentives are of two kinds: those, such as food, which in- 
vite approach; and those, such as noxious stimuli, which invite avoidance. 
Hull, selecting some older terms, called approach behavior adience or 
adient behavior , and withdrawal abience or abient behavior . 

Hull takes the major steps from the postulates to the basic theorems 
of adient and abient behavior informally, that is, without quantitative 
reference to the postulates (except for similarities in the forms of func- 
tion chosen). In the first step, he accepts it as plausible that 'approach 
responses to an attractive incentive near at hand have been conditioned 
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according to familiar principles. The same is implied, but in reverse, for 
withdrawal responses. At a distance, an incentive object, whether adient 
or abient, will produce a stimulus pattern resembling the one close to the 
object, so that responses conditioned to the object itself will be condi- 
tioned to the object at a distance, 'at an intensity corresponding to the 
difference between the stimulus patterns for the near and far object. 
Thus the familiar generalization gradient applies. It is further assumed, 
on the basis of these considerations, and because locomotion is itself a 
highly generalized kind of behavior, that both ’adient and abient behavior 
will be highly generalized in respect to both direction and distance. 
Through these informal steps, largely by analogy with matters dealt with 
in the postulates and corollaries, bull succeeded in providing for the 
kinds of conflicts in free space that Lewin treated under his concepts of 
“valence,” an adient goal being ( for Lewin ) positively valenced, and an 
abient goal being negatively valenced. 

Now we come to the quantitative portion of the deduction. For this 
purpose we take a special problem of adient-abient conflict, best illus- 
trated in the experiments of J. S. Brown (1942) (1948) and the related 
experiments as reported by N. E. Miller (1944). 91 Brown determined two 
points on a gradient of adient behavior, and two points on a gradient of 
abient behavior, as illustrated in Figure 22. A hungry rat ran down an 
alley to receive food at the end, the food location being distinctively 
marked with a light. The animals wore a little harness attached to a cord. 
The harness did not restrain them during their learning, but in testing 
they could be restrained by the cord, and the strength of pull measured. 
This is the pull in grams as recorded in Figure 22 at the two points of the 
approach gradient Other animals, instead of receiving food at the end 
of the alley, received electric shock. When restrained near the point of 
shock, they pulled hard to get away from the shock zone, but if restrained 
at a distance they pulled away only slightly. (These two points make up 
the avoidance gradient, which is seen to be much steeper than the ap- 
proach gradient! Miller and his associates predicted that if animals were 
both fed and shocked at the end of the alley, they would be placed in 
conflict. When placed at the end of the alley farthest from the ambivalent 
goal, they should approach part way and then stop, the point of stopping 
being related to the point at which the two gradients cross, because here 
the opposite pulls are equal. By strengthening the hunger or weakening 
the shock, the animals could be made to approach nearer to the goal 

91 The important evidence for the theorem under study is from an unpublished study 
by Miller, Brown, and Lipofslcy ( 1943), discussed by N. E. Miller { 1944), page 437. 
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before stopping. Hence the experimental results agreed with the inter- 
pretation that the two independent gradients measured by Brown would 
interact when both food and shock occurred together. 
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FIGURE 22. Empirical form of approach and avoidance gradients. The straight- 
line gradients are based on two points determined by J. S. Brown for each gradient 
separately, as reproduced by Miller (1944), page 434. The theoretical point of equi- 
librium has been added. 


Now let us see how Hull, with these facts before him, ties them into 
his system. This is a good illustration of prediction after-the-fact, but 
such prediction is, of course, entirely permissible in a deductive system. 
The theorem with which Hull comes out is stated as follows: 02 


Theorem 86. Other things equal, with moderately sophisticated subjects, 
when an adient object and an abient object occupy nearly the same point in space 
and the maximum abient reaction potential is greater than the maximum adient 
potential, there will be a point of stable equilibrium at a j.n.d. distance from 
the adient-abient object amounting to 


sE' r 


d = 


^&10 jg 


f-1 


Miller s conclusion is thus given a precise mathematical form. The 
derivation here will serve to provide a step-by-step illustration of the 
method Hull used much more generally. 

First, for the straight-line gradients of Figure 22, Hull substituted 


Hull (1952a), pages 249-250. 
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the curved gradients of Figure 23, in order to use curves of the same 
form that he used in generalization gradients (Postulate 10). The adience 
exponent he made half that of the abience exponent, simply because 
Brown and Miller had found the two gradients to differ in slope. 

This tampering with the exponents, in order to agree with the results 
of experiments, is allowable, but it should follow some of the rules of 
the system. In strict logic, because Hull does not deduce these differences 
in exponents, he must postulate them. Therefore he has added a new 
but unacknowledged postulate to his system at this point (until such 
time as the differences are derived from more fundamental principles). 



DISTANCE FROM POINT OF REINFORCEMENT (j.h.d.s) 

FIGURE 23. Theoretical adient and abient gradients used by Hull in predicting 
the point of equilibrium in an adient-abient conflict. Compare with the empirical 
results of Figure 22. Replotted from Hull (1952a) to conform with the conventions 
of Figure 22. 

The point of equilibrium in Figure 23, like that in Figure 22, is the 
point at which the two curves cross. What Theorem 86 states is merely 
the algebraic solution of the problem solved graphically in Figure 23. 

For adient behavior, we assume the gradient to be of the form: 

bEb = bEeX10-* (1) 

Where S E R is the generalized reaction potential at any point on the gradient at 
a distance d in j.n.d/s from the point of reinforcement. 

The corresponding gradient for abience is: 

sE'r = s E' r X 10-i'tf 


(2) 
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To derive the equation required for the theorem we find that value of 
d at which the two gradients will balance each other, where S E R = S E' R . 
This is a matter of simple algebra, which looks more forbidding than it 
is partly because the subscripts S and R are appendages without any 
mathematical meaning. 

We know that, at the value of d which we seek 


S Er sE'r 

(3) 

Hence, from (1) and (2) 


s E It X 10“ w = S E' U X 10-^ 

(4) 

Eliminating the negative exponents, we may write 


sEr _ 

10** I0)' d 

(5) 

Transposing terms, we have 


10* s E' r 

10* _ s e r 

(6) 

Taking the logarithm of both sides, 


fd-jd = log. jf®- 

S&R 

(7) 

Solving for d , the value we are after, 


t sE'r 

1o §io t? 

d= v sEk 

f~\ 

(8) 


Equation (8) is the one appearing in the theorem. It results solely 
from selecting as the form of the gradient the form earlier adopted for 
generalization gradients, with the added assumption (similar to Miller’s) 
that there is an equilibrium point where the two gradients coincide. 

To prepare a plot as in Figure 23, it is necessary to make computa- 
tions, using actual values for the terms entering into equation (8). Let 
us see how Hull did this. He assumed the following values, which, while 
arbitrary, are of an order of magnitude used elsewhere in the system: 

S E n — 4 a I 'pkg g rea {er magnitude of the abient reaction potential S E' R is 

tji of assumed in Theorem 86. 
s Jii R — cr J 

= .02 \ The greater steepness of the abient gradient (/' > j) is assumed to 

= .01 J be a fundamental characteristic. 
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Substituting these values in Equation (8), we have 

log log 1.333 

d ~ .02 - .01 “ .01 

Looking up the value of log 1.333 in a table of logarithms, we find 
log 1.333 = .1248, hence 

, .1248 ,, 

a = — = 12.48 ].n.d. s 

By examining Figure 23, we see that the algebraic and graphical 
solutions agree (as indeed they should!). 

What do we learn from these computations? When we follow Hull’s 
computational technique, we find that very often the results are obtained, 
as in this example, through assigning equations to simultaneous functions, 
and then solving the simultaneous equations. The tie-in with the total 
system is often very slight. In this case, the forms of the gradients are 
the same as those used in generalization gradients, and the constants 
are similar in meaning. But the accuracy of the deductions does not de- 
pend upon these similarities, and, in fact, some of the correspondence 
between the system and the deductions is purely gratuitous. For ex- 
ample, no operational meaning is given to a difference in j.n.d/s along 
a gradient of this kind, so that there is no check whatever on the use of 
these units. An accurate use of empirically defined j.n.d. units might very 
well change the slope of the gradient. Furthermore, the puzzling prob- 
lems of the relative effectiveness of reward and punishment, which ought 
to be dealt with as central features of a reinforcement theory, are here 
glossed over by the very molar assumption that the exponent for the 
gradient of abience is double that for adience. For a genuinely precise 
deduction it would be necessary also to show that strength of pull when 
restrained has a determinate relationship to reaction potential as quanti- 
fied in the system. Despite the appearance of precision, and of tie-in 
with his system, Hull’s deduction is by no means rigorous and represents 
little, if any, advance over the simpler presentation of Figure 22. 

Proceeding in somewhat similar manner, now that the general prin- 
ciples of adience and abience were accepted and in some sense justi- 
fied, Hull was able to combine the principle of the habit-family hierarchy 
with that of the gradient of reinforcement to explain the attraction of 
goals beyond barriers, and of the paths taken around such harriers. These 
are the kinds of problems in open space emphasized in field theories such 
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as Lewin’s. Hull was willing to go so far as to borrow portions of the 
physical analogies used by the field theorists: 

Assuming as a first approximation that behavioral vectors in quite naive 
subjects operate roughly as physical vectors, even a slight movement to one side 
of the line will unbalance the otherwise completely opposed reaction tend- 
encies. . . , 93 

Because adience and abience operate in two-dimensional space, Hull 
acknowledged that he had developed a field theory of sorts: 

The theory of adient and abient behavior thus involves examples of bona 
fide field theorv though this theory must not be confused with physical field 
theories, from wire'. the present theory differs in most respects. 94 

Hull referred to two chief differences between his kind of field theory 
and physical field theory: first, the organism is not propelled, as a 
physical particle is, by external forces from the field, but the energy 
for its movements comes primarily from the food it eats; second, the 
distances important in physics are actual distances (centimeters or 
meters) while the distances in his system are in j.n.d.’s, hence defined ac- 
cording to the organism’s reactions. The residual difference between Hull 
and Lewin on these points appears to be almost purely linguistic, for 
Lewin would not have objected to the substance of either of Hull’s two 
points. 

Success in Prediction 

Eventually a self-correcting hypothetico-deductive system must be 
judged by its successes, that is, the agreements between predictions and 
observed events, and by its adjustments to its failures. 

On any kind of tabulation, Hull’s several postulate systems have 
fared very well. 

A tabulation of the material in the rote-learning book showed evi- 
dence bearing on 71 of the 110 theorems and corollaries. The evidence 
gave clear support in 39 cases (55 per cent), was somewhat ambiguous 
in 20 cases (28 per cent), and disagreed with the prediction in the re- 
maining 12 cases (17 per cent). 05 

The successes represent a substantial achievement, but the failures, 
too, are achievements, in that they show Hull’s statements to have been 
made clearly enough that data could contradict them. 

93 Hull ( 1952a), page 242. 

94 Hull (1952a), page 269. 

95 Hilgard (1940). 
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Hull presented his own summary of successes in prediction in the 
final book. 96 He classified the evidence bearing on 121 of 178 theoretical 
propositions in that book. Of these, he found 106 (87 per cent) substan- 
tially validated, 14 (12 per cent) probably valid, but uncertain, and 
only one (1 per cent) definitely invalid. 

This very high agreement is somewhat disturbing at this stage in 
science, even though the ultimate aim is exceptionless validity. Too high 
agreement, when the system is as unfinished as this one, suggests either 
(1) low standards with respect to what constitutes agreement, or (2) too 
great reliance on working backward from data, with ad hoc assumptions 
to guarantee agreement. Guaranteed agreements should not be tabulated 
as verifications. Working backwards is not, in itself, a defect in a theo- 
retical system. In fact, there is no real alternative, for the improvement 
in a system comes precisely through "working backwards” to obtain 
better agreement with reality. But there are rules by which the deduc- 
tions remain coherent and internally consistent, and these are often 
difficult to apply. As Hull pointed out, the 55 predictions in his book not 
covered by known relevant evidence will provide a good test of the fer- 
tility of his system, because they are free of the charge of working 
backward from the known. 

ESTIMATE OF HULL’S POSITION 

HulFs Position on Typical Problems 

'Because he was willing to face the problems of learning posed by 
others as well as those which he set himself, it is possible to assign Hull 
a position on the representative problems chosen as the basis for compar- 
ing the different points of view. 

1. Capacity. A volume on individual differences on which Hull had 
been working was not completed by the time of his death. He published 
but a single paper on the problem 97 and did not carry the analysis much 
further in his final book. Individual differences in capacity will be re- 
flected in the constants that appear in the behavioral laws. Once these are 
under control, it will be possible to make fundamental attacks on such 
problems as that of the relationship between learning and persistent 
indiyidual differences. 

{2. Practice. Mere contiguous repetition does nothing but generate 
inhibition; all improvement depends upon reinforcement. Hull is in this 

9 «Huli (1952a), 351-353. 

* Hull (1945). 
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respect in agreement with Thorndike and Skinner, and opposed to Guth- 
rie. Because the amount of reinforcement does not affect habit strength, 
provided some minimum amount is present, the number of reinforce- 
ments is the basic variable in acquiring habit strength. 

3. Motivation . The basic paradigm for reinforcement lies in need 
reduction, as when food relieves the body's need for sustenance, or when 
escape saves the organism from injury. Thus the primary reinforcing 
quality of reward and of punishment is the same: food reward relieves 
hunger tension, escape from shock reduces shock tension. While these 
are the underlying biological facts, the reinforcement itself is mediated by 
stimulus reduction, that is, by reducing the stimuli associated with the 
drive, rather than by satisfying the drive itself. Secondary reinforcement 
can be provided by any stimulus regularly associated with primary rein- 
forcement. 

Drive is complexly related to learning: (1) it provides the basis 
for both primary and secondary reinforcement, (2) it activates habit 
strength into reaction potential, and (3) it provides differential internal 
stimuli that guide behavior. 

More complicated relationships involving anxiety, avoidance, ex- 
pectation, frustration, conflict, may be derived from the more primitive 
principles of primary and secondary reinforcement. Important inter- 
mediate derivations include adient (approach) behavior to perceived 
positive incentives and abient (withdrawal) behavior to perceived nega- 
tive incentives. 

4. Understanding. The organism's own responses furnish the surro- 
gates for ideas. The fractional antedating goal response (r G ) provides 
stimuli ( $ G ) whose sole function, in some instances, is to guide behavior. 
Responses that provide such stimuli are called “pure stimulus acts.” Ideas 
thus have the substantive quality that Guthrie also assigns to them. 
Two intermediate mechanisms emerge as important in meeting the prob- 
lems of behavior in space, and in problem-solving generally. These are 
the principle of the gradient of reinforcement, extended to include objects 
perceived at a distance, and the habit-family hierarchy, which permits the 
maximum utilization of past experience in the solving of present problems. 
Both principles depend for their effectiveness upon aroused fractional 
anticipatory responses, and upon discriminations among the stimuli that 
these responses produce. 

5. Transfer. There are two aspects to transfer: equivalence of stimuli 
and equivalence of responses. Hull explains equivalence of stimuli either 
on the basis of generalization, or via intermediate reactions. Equivalence 
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of responses depends in part upon response oscillation, and the generaliza- 
tion there involved; it depends also on the organization of responses into 
hierarchies by way of the habit-family hierarchy. All responses in the 
hierarchy have in the past led to the same goal, so that they are in that 
respect equivalent. 

6. Forgetting. In the volume on rote learning, the decay of excita- 
tion is postulated by Hull and his associates to occur according to a kind 
of law of disuse, making forgetting a function of time. He reaffirmed 
this position in one of the corollaries of the 1943 book, but did not pro- 
vide for it in his postulate set. 98 I find no reference to forgetting as such 
in the 1952 book, the only references being to the rise in the curve of 
reminiscence, without discussion of the subsequent fall. The only ac- 
quired function that decays systematically with the passage of time is 
reactive inhibition (In). 

It is pertinent to refer here to Hull's own summary of the basic 
principles as he perceived them, after completing the system reported 
in his final book. He listed and described eight automatic adaptive be- 
havior mechanisms. 

1. Inborn responses tendencies ( S U R ) provide the first automatic 
mechanism for adapting to emergency situations. 

2. The primitive capacity to learn is the second mechanism, “a 
slightly slower means of adaptation to less acute situations/ 7 

3. The antedating defense reaction, arising through learning com- 
bined with stimulus generalization, provides the third adaptive mech- 
anism. 

4. The extinction of useless acts, negative response learning, is the 
fourth mechanism. 

5. Trial-and-error learning is the fifth mechanism. 

6. Discrimination learning is the sixth mechanism. 

7. A second type of antedating defense reaction, depending upon 
the persistence of stimulus traces (rather than upon generalization, as in 
the case of a perceived dangerous object), is the seventh mechanism. 

8. The fractional antedating reaction (r G ) with its proprioceptive 

stimulus correlate ( s G ), provides for the “automatic (stimulus) guidance 
of organismic behavior to goals.” This eighth and final mechanism is the 
crowning achievement of the system. ; 

Further study of this major automatic device presumably will lead to the 
detailed behavioral understanding of thought and reasoning, which constitute 

98 Hull (1943), page 296. 
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the highest attainment of organic evolution. Indeed the r G » s Cr mechanism 

leads in a strictly logical manner into what was formerly regarded as the very 
heart of the psychic: interest, planning, foresight, foreknowledge, expectancy, 
purpose, and so on." 

The Status of HulVs Final Reinforcement Theory 

Hulls theory is not a highly integrated one, so that no one concept 
is truly central to it. If one accepts his own summary, according to the 
foregoing eight mechanisms, the fact of learning is important, but the 
particular mechanism of learning is not crucial. Perhaps the most crucial 
concept is that of the r G -$ G mechanism, which has only a tangential rela- 
tionship to reinforcement theory. Nevertheless, the most controversial 
issue among the S-R theories concerns the status of reinforcement. 

Hulls vacillation between the items to be reduced in primary rein- 
forcement led him to reject D, to turn to S D , then to add s G as an after- 
thought. There was no finality to his interpretation of primary reinforce- 
ment. We shall return to this issue again in Chapter 12, when we con- 
sider the views of some of his followers, and some recent experiments. 

He was also quite unclear as to what lay beneath secondary rein- 
forcement, although he was clear as to its importance. A stimulus as- 
sociated with primary reinforcement became at once (1) a conditioner of 
secondary drive, hence a producer of S D , and (2) a secondary reinforcer, 
and hence the equivalent of reduced S D . Hull was not really clear about 
this, because he seems not to have thought about both secondary drive 
and secondary reinforcement at the same time. His thinking was par- 
ticular, not general. When he thought of secondary drive, he thought of 
what Miller and Mowrer talk about as fear or anxiety. When he thought 
of secondary reinforcement, he thought of such things as distinctive end- 
boxes, or tokens substituting for food. But when he wrote his formal 
propositions, they implied that a stimulus associated with reinforcement 
could become at once both a drive and a reinforcing agent, for the con- 
ditions necessary to become a secondary drive appear to be the same as 
the conditions for becoming a secondary reinforcing agent (Corollaries 
i and ii). The literary form of the two corollaries differs somewhat, and it 
may be that Hull intended a difference, but if he did he was not clear. 

Hence these fundamental matters — the nature of primary and sec- 
ondary reinforcement — he really left for others to settle. 

99 Hull (1952a), page 350. 
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The Quantification of Reaction Potential 

Just what did Hull achieve in the effort to quantify reaction potential, 
a task to which he devoted much effort, and in the success of which he 
took some satisfaction? 

It is worth taking a look at what he did, because the problem of 
assigning quantities to intervening variables is one that other theorists 
who use such variables (e.g., Tolman) must eventually face. 



FIGURE 24. The logical problem of measuring intervening variables. 

A simplified version of Hull's system is presented in Figure 24. It 
omits generalization effects, K, V, and inhibition, but it includes the most 
important intervening variables S H R , D, and S E R , circled in the diagram 
to distinguish them from the input and output variables. The problem is 
to get from the external world of input and output to these intervening 
variables, in terms of which the most important equations are to be writ- 
ten. If we assume everything else constant, our fundamental equation 
is: 

S E R = / 3 ( S H R ,D) (1) 

As a first approximation Hull assumed the equation to be of the 
form: 

§E R = gH R X D (2) 

Rut while we are about the problem of strict quantification, we can- 
not make this assumption. It is part of our empirical-theoretical problem 
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to determine the form of the f 3 function. We cannot move from equation 
(1) to equation (2) without proof. 

Now, in general, we have three helpful contributions to determining 
the f functions, because each of the intervening variables is anchored to 
input or output. We have the three functions: 

s H r =zf 1 (N) , where N is the number of reinforcements (3) 

D = f 2 (h), where h is hours of food deprivation (4) 

s E r = f 4 ( S £ R ), where s f R is the latency of response (5) 

Were we able to determine f 19 f 2 , and / 4 , we should have some 
chance of determining f 3 . We can find equations in Hull’s system for 
each of these functions, so we might assume that he had followed out 
this program. Actually he did nothing of the kind. 

The only function indicated in Figure 24 that is actually given in the 
data is / a , which can be stated: 

8*B = /a(N) (6) 

That is, we know from experiments that in some instances latency 
of response becomes progressively shorter as the number of reinforce- 
ments increases, and we can fit curves to our results, or scale our results, 
if we wish. Hull and his collaborators proceeded to scale this function, 
and then named the scaled value S E K . That is, the f i function is deter- 
mined on the assumption that reaction potential bears a one-to-one cor- 
respondence to scaled latency. Then 

g E R = / 4 (s^r) (7) 

Now we are able to make an algebraic manipulation between equa- 
tions (6) and (7), substituting S E R for s f R in equation (6), and we ap- 
pear to have measured an intervening variable: 

sE B ==/b(N) (8) 

This is really the extent to which Hull, strictly speaking, quantified 
intervening variables. The f 3 function was never determined from inde- 
pendent computations of S H R , D, and S E R by way of the appropriate f 19 
f 2 , and f 4 functions. 

A further step was taken to complete the picture by using the data 
from Perm (1942), in which a functional relationship was established 
between hours of food deprivation and latency of response. Again by 
using the f 4 function shown in Figure 24, it was possible to state a re- 
lationship between hours of food deprivation ( h ) and reaction potential 
( S E R ). But this depended solely on the relation between ( h ) and ( s fe) 
and still did not solve the problem of the f l9 f, 9 and f z functions, which is 
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really the heart of the problem of assigning quantities to intervening vari- 
ables. Hull was rightly aware of the task that needed to be done, but he 
failed to accomplish it. 

The quantification task that was carried out so carefully — the scaling 
of latency — was not a total loss, for in the process some contributions 
were made to scaling theory; and the new position given behavioral 
oscillation in the system led to some interesting implications, as we saw 
in the example of predicted alternation sequences. 

How Satisfactory a System Did Hull Leave? 

Hull’s system has many points of superiority over any other existing 
psychological system. It is at once comprehensive and detailed, theoreti- 
cal, yet empirically quantitative. It is easy to locate faults within it be- 
cause it is so carefully worked out, so explicit and mathematical that 
its errors of incompleteness or inconsistency are easily brought into focus. 
A theory expressed solely in the ordinary literary language may sound 
very plausible because the gaps in the theory are glossed over through 
cogent illustrations. A theory such as Hull’s calls attention to itself when- 
ever it jumps a gap. In criticizing Hull’s theory we must not lose sight of 
the fact that, with all its weaknesses, it is a major achievement. 

We do well to think of Hull’s system as really twofold. On the one 
hand, he was embarked upon a bold and comprehensive theory of be- 
havior, a theory he hoped would serve as a basis for much of social sci- 
ence. On the other hand, he was experimenting with a very precise minia- 
ture system, with determinate constants based upon controlled experimen- 
tation. He attempted to combine these two enterprises at once, and was 
not very skilled at distinguishing between what he accomplished on a 
large scale and on a small scale, for he wanted the whole to be one sys- 
tem. 

On a large scale, when dealing with behavior in free space, problem- 
solving, and ideas, Hull made skillful use of peripheral mechanisms, 
particularly the hypothesized r G — s G sequence. By interlocking these 
goal anticipations into other features of his system, particularly through 
the gradient of reinforcement and the habit-family hierarchy, he was able 
to make large-scale deductions of familiar forms of behavior. When mov- 
ing on this large-scale level, the theory was very "molar” indeed, and al- 
most no efforts were made to pin down precisely the kinds of anticipatory 
movements (chewing movements, bodily postures) that would serve as 
the tangible base for the important fractional antedating response. The 
choice of this mechanism was made (as a surrogate for ideas) because it 
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was ponderable, and hence, in principle, at least, subject to measurement. 
One can easily think of possible experiments to test aspects of the r G 
mechanism, especially the grosser aspects. 100 If experiments cannot be 
designed to test the subtler r G mechanisms, then one might as well deal 
with “ideas” as with “pure-stimulus acts,” The argument that the charac- 
teristics of these acts can be inferred, if not measured, has some appeal, 
but it is possible to infer discriminations between ideas or engrams, too. 
Perhaps r G need have no more tangible reality than s Hu, but the system 
always implies that it is a genuine response, with proprioceptive conse- 
quences. It may be noted also that the large-scale deductions, built on 
r G , do not require any one special theory of learning, so long as goal 
anticipation can be achieved by that theory. 

When Hull was operating on a smaller scale, and attempting to be- 
come precise and quantitative, he became highly particularistic, confining 
many of the postulates and corollaries to the results of single experiments 
done with rat bar-pressing in a Skinner box modified so that a latency 
measure could be secured. Hull became so preoccupied with this quanti- 
fication that he failed to distinguish between this exercise in miniature- 
system construction and the larger task on which he was simultaneously 
engaged. He therefore began to write into his basic postulates what were 
really exercises in quantification. Surely a set of laws of mammalian 
behavior ought not reflect in the constants of its basic postulates such 
specialized information as the most favorable interval for human eyelid 
conditioning (Postulate 2), how many days without food weakens a rat 
(Postulate 5), the weight in grams of food needed to condition a rat 
(Postulate 7), a rats reaction time (Postulate 13), or the amplitude in 
millimeters of galvanometer deflection in human conditioning (Postulate 
15) . 

It would have been preferable to distinguish between the tentative 
generalization and then separately to offer the quantitative evidence as 
illustrative. An alternative would have been to limit the miniature system 
to the rat in the bar-pressing box, with every effort to be systematic and 
to make the various constants interchangeable, to make a determined 
effort to measure intervening variables within this one limited universe. 
Then the particularistic postulates would have been appropriate, as other 
postulates were appropriate to rote learning. 

The source of these deficiencies in the system cannot easily be speci- 
fied. It is difficult to distinguish between weaknesses inherent in the 
system, and problems that were Hull’s as a theorist and system-maker. 

100 An early example was gnen by N. E. Miller ( 1935). 
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We have two illustrations in his career of systems beginning with a 
large sweep and then, after heroic quantitative effort, ending so par- 
ticularistic as to be much less inviting than they were before the effort 
was made* The first was the rote-learning theory, which, in its grosser 
form 101 appeared more inviting than in its better worked-out form. 102 
So many ad hoc assumptions had to be made in the more precise formu- 
lation that little has been heard from this since. Now we find a similar 
progression in the theory of adaptive behavior. The early papers, in the 
1930’$, gave a new outlook upon the systematic prediction of behavior 
in a great variety of situations. The postulates were based largely upon 
the interpretation of maze behavior (simple trial and error, and serial 
learning). Now, as the system became more precise, these larger predic- 
tions were mostly carried over from the past; the new postulates in both 
1943 and 1952 became so tied to a limited kind of experiment (chiefly 
rat bar-pressing) that they are little likely to survive. It may be that 
particularism is an inevitable consequence of seeking precise quantita- 
tive prediction in the present stage of behavioral science. If so, it means 
that the theorist, at least for a time, will have to be modest in what he 
attempts to do, and may have to be satisfied to remain within a minia- 
ture system. If particularism is not the inevitable result of the scientific 
logic used, then it may be that something was wrong with the kind of 
concepts employed. The additional possibility is that some limitation 
in Hull as a systematist led him to particularism. Whatever the reason, 
Hulls followers will undoubtedly be able to build on his foundation to 
produce a more satisfactory postulate set than he left. 

It must be acknowledged that Hull's system, for its time, was the 
best there was — not necessarily the one nearest to psychological reality, 
not necessarily the one whose generalizations were the most likely to 
endure — but the one worked out in the greatest detail, with the most con- 
scientious effort to be quantitative throughout and at all points closely 
in touch with empirical tests. Furthermore, it may well be said to have 
been the most influential of the theories between 1930 and 1950, judging 
from the experimental and theoretical studies engendered by it, whether 
in its defense, its amendment, or its refutation. Its primary contribu- 
tion may turn out to lie not in its substance at all, but rather in the ideal 
it set for a genuinely systematic and quantitative psychological system 
far different from the “schools” which so long plagued psychology. 

101 Hull (1935a). 

102 Hull and others ( 1940). 



HULL’S SYSTEMATIC BEHAVIOR THEORY 


183 


SUPPLEMENTARY READINGS 


BOOKS 

Hull, C. L., Hovland, C. I., Ross, R. T., Hall, M., Perkins, D. T., and Fitch, 
F. B. (1940) Mathematico-deductive theory of rote learning. 

Hull, C. L. (1943) Principles of behavior. 

Hull, C. L. (1951) Essentials of behavior . 

Hull, C. L. (1952) A behavior system. 

SHORTER INTRODUCTIONS 

Hull, C. L. (1942) Conditioning: Outline of a systematic theory of learning. 
Chapter 2 in The psychology of learning. Natl. Soc. Stud. Educ., 41st Year- 
book, Part II, 61-95. 

Hull, C. L. (1950) Behavior postulates and corollaries — 1949. Psychol. Rev., 
57, 173-180. 

Lambert, W. W. (1954) Stimulus-response contiguity and reinforcement the- 
ory in social psychology. In G. Lindzey, ed. Handbook of social psychology. 
65-71. 

Spence, K. W. (1951) Theoretical interpretations of learning. In C. P. Stone, 
ed. Comparative psychology. Third edition. 239-291. 

Spence, K. W. (1951) Theoretical interpretations of learning. In S. S. Stevens, 
ed. Handbook of experimental psychology. 690-729. 

CRITICAL REVIEWS 

Hilgard, E. R. (1940) Review of Mathematico-deductive theory of rote learn- 
ing: The psychological system. Psychol Bull., 37, 808-815. 

Koch, S. (1944) Review of Hulls Principles of behavior . Psychol. Bull., 41, 
269-286. 

Leeper, R. (1944) Dr. Hull's Principles of behavior. J. genet. Psychol. , 65, 3—52. 

Leeper, R. (1952) Review of Hull's Essentials of behavior. Amer. ]. Psychol. , 
65, 478-491. 

Marhenke, P. ( 1940) Review of Mathematico-deductive theory of rote learning: 
The logical system. Psychol. Bull. , 37, 815-817. 

Ritchie, B. F. (1944) Hull's treatment of learning. Psychol. Bull. , 41, 640-652. 

Schoenfeld, W, N., and Bersh, P. J. (1952) Review of Hull's Essentials of be- 
havior. Psychol. Bull., 49, 628-636. 

Seward, J. P. (1954) Hull's system of behavior: an evaluation. Psychol. Rev., 
61, 145-159. 

Skinner, B. F. (1944) Review of Hull's Principles of behavior. Amer. }. Psychol. , 
57, 276-281. 

representative experiments 

Brown, J. S. (1948) Gradients of approach and avoidance responses and their 
relation to levels of motivation. J. comp, physiol. Psychol., 41, 450-465. 

Ellson, D. G. (1938) Quantitative studies of the interaction of simple habits. 
I. Recovery from specific and generalized effects of extinction. J. exp. 
Psychol, 23, 330-358. 

Grice, G. R. (1949) Visual discrimination learning with simultaneous and suc- 
cessive presentation of stimuli. J. comp, physiol. Psychol, 42, 365-373. 

Hull, C, L. (1935) The influence of caffeine and other factors on certain phe- 
nomena of rote learning. J. gen . Psychol, 13, 249-274. 



184 THEORIES OF LEARNING 

Hull, C. L. (1947) Reactively heterogeneous trial-and-error learning with dis- 
tributed trials and terminal reinforcement. J. exp. Psychol 37, 118—135. 
Kendler, H. H. (1946) The influence of simultaneous hunger and thirst drives 
upon the learning of two opposed spatial responses of the white rat. J. exp. 
Psychol , 36, 212-220. 

Kimble, G. A. (1949 ) An experimental test of a two-factor theory of inhibition. 
/. exp. Psychol. , 39, 15-23. 

Perin, C. T. (1942) Behavior potentiality as a joint function of the amount of 
training and the degree of hunger at the time of extinction. J. exp. Psychol ., 
30, 93-113. 

Spence, K. W. ( 1942) The basis of solution by chimpanzees of the intermediate 
size problem. J. exp. Psychol. , 31, 257-271. 

Spence, K. W., Bergmann, G., and Lippitt, R. A. (1950) A study of simple 
learning under irrelevant motivational-reward conditions. J. exp. Psychol., 
40,539-551. 

Woodbury, C. B. (1950) Double, triple, and quadruple repetition in the white 
rat. J. comp, physiol. Psychol., 43, 490-502. 

Yamaguchi, H. G. (1952) Gradients of drive stimulus (S D ) intensity generaliza- 
tion. J. exp. Psychol, 43, 298-304. 

FINAL NOTE 

If at some points the reader believes that I have been more severely critical 
of Hulls system than is objectively justified, he should perhaps be warned that at 
least one competent reviewer detected a bias against S-R theories in the earlier 
edition of this book: 

Miller, N. E. (1949) Review of Hilgard’s Theories of learning. Psychol. Bull. , 
46, 529-532. 

For a detailed and searching criticism of Hull's system from the standpoint 
of the logic of science, see: 

Koch, S. (1954) Clark L. Hull. In W. K. Estes and others. Modern learning the- 
ory. 1-176. 

To achieve a balanced view of Hull's earnestness, and his modesty about his 
attainments, his autobiography is well worth reading: 

Hull, C. L. (1952) Autobiography. In H. S. Langfeld and others. A history of 
psychology in autobiography. Volume 4. 143-162. 



Chapter 6 

TOLMAN’S SIGN LEARNING 


T HE theory of Edward C. Tolman was called purposive behaviorism 
in his major systematic work, Purposive behavior in animals and 
men (1932). Since then he (and others) have called it a sign-gestalt 
theory, a sign-significate theory, or an expectancy theory. These later 
terms all emphasize the cognitive nature of the theory, which distin- 
guishes it in certain respects from the stimulus-response theories of Thorn- 
dike, Guthrie, Skinner, and Hull. The designation sign learning provides 
a satisfactory short name, abbreviating sign-gestalt and sign-significate, 
while calling attention to the cognitive reference within the theory. 

Tolman acknowledged the complex affiliations of his system — with 
Watson's behaviorism, McDougall’s hormic psychology, Woodworths 
dynamic psychology, gestalt psychology (in both classical and Lewin’s 
forms). He later recognized a number of parallels between his system and 
the probabilistic functionalism developed independently by Brunswik. 1 
Still later, the influence of psychoanalysis began to be noticeable. 2 Be- 
cause of these complex affiliations it is appropriate to consider his sys- 
tem after our familiarity with the other behaviorisms, and before turn- 
ing to the non-behavioristic theories. 

Despite some shifts in vocabulary, Tolman has held firm to his main 
tenets during the more than twenty years since his book appeared: 

1. His system is a genuine b ehavio rism , and as such rigidly rejects 
introspection as a method and 4 raw feels” as data for psychological sci- 
ence. When he makes reference to consciousness, to inventive ideation, 
and the like, he is talking about interpretations of observed behavior. 
He does not accept ‘Verbal report” as a dodge by which to smuggle 
consciousness in through the back door. 

2. The system is a mohr , rather than a molecular , behaviorism. An 

1 Tolman and Brunswik ( 1935 ) . 

2 Tolman (1942) (1943). 


185 



186 THEORIES OF LEARNING 

act of behavior has distinctive properties all its own, to be identified and 
described irrespective of whatever muscular, glandular, or neural proc- 
esses underlie it The molecular facts of physics and physiology upon 
which behavior rests have identifying properties of their own, which are 
not the properties of behavior as molar. This means for Tolman an in- 
dependence from physiology, a characteristic which he shares with 
several of the writers whom we have considered. 

3. The system is a purposivwn , but of a sort to avoid the implica- 
tions of a teleological metaphysics. It is a purposivism because it recog- 
nizes that behavior is regulated in accordance with objectively determi- 
nable ends. It is not mentalistic; purposes are not those of a self-conscious 
mind. It is not in agreement with teleological points of view which make 
effects take precedence over and determine their causes. 

The strongest rejection is of American structuralism, because struc- 
turalism was dependent upon introspection of the most offensive sort, 
that known as Beschreibang . Watsonian behaviorism is almost as vigor- 
ously rejected, because it was not only molecular but tended to neglect 
the problems of goal-seeking behavior. 

THE SYSTEMATIC POSITION 
Behavior as Molar 

The descriptive properties of molar behavior are the most general 
characteristics of behavior which would impress themselves upon an in- 
telligent onlooker without prepossessions and before any attempt to ex- 
plain how the behavior comes about 

; JFirst, behavior is goabdjjeeted. It is always, a getting-toward some- 
thing, or a getting-away from something. The most significant descrip- 
tion of any behavior is what the organism is doing, what it is up to, where 
it is going. The cat is trying to get out of the box, the carpenter is build- 
ing a house (or earning a living), the musician is seeking acclaim for his 
virtuosity. The particular movements involved are less descriptive of 
the molar behavior than is the goal toward which or away from which 
the movements lead. This feature characterizes molar behavior as pur- 
posive. 

Second, the behavior makes use of environmental supports as 
means-objects toward the goal. The world in which behavior goes on is 
a world of paths and tools, obstacles, and by-paths with which the or- 
ganism has commerce. The manner in which the organism makes use of 
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paths and tools in relation to its goals characterizes molar behavior as 
cognitive as well as purposive. 

Third, there is a selective preference for short or easy means-activities 
as against long or difficult ones, called the principle of least effort. 

- Fourth, behavior, if it is molar, is docile. That is, molar behavior is 
characterized by teachableness. If it is mechanical and stereotyped, like 
a spinal reflex, it belongs at the molecular level. Docility is said to be a 
mark of purpose. 3 

Intervening Variables 

The complete act of behavior is initiated by environmental stimuli 
and physiological states. Certain processes intervene, and behavior 
emerges. Programmatically, this is the formula which Hull took over 
from Tolman. The problem of psychological analysis at the molar level is 
to infer the processes which intervene between the initiation of action in 
the world of physics and physiology and the resulting observable conse- 
quences, again in the world of physics and physiology. Because all of 
the data are rooted in this world, the system remains a behaviorism. 

In spite of his methodological behaviorism, Tolman is clearly bent 
on making a ‘psychological” as against a “physiological” analysis. The 
intervening variables include such processes as cognitions and purposes, 
so that, on one side of its ancestry, Tolman’s position belongs with the 
gestalt psychologists who have been characterized as “centralists,” rather 
than with the stimulus-response psychologists characterized as “periph- 
eralists.” 4 

The precise variables entering into behavior determination have not 
remained fixed in Tolmans later discussions, but his logic of system- 
making has remained the same. The set of terms used in his presidential 
address before the American Psychological Association in 1937 & may 
serve as illustrative. 

The background of physiology and physics with which choice-point 
behavior begins is defined by environmental and individual difference 
variables: 

I, Environmental Variables 

M — Maintenance schedule 
G — Appropriateness of goal object 
S — Types and modes of stimuli provided 

3 In this Tolman follows Perry { 1918 ). 

4 Murray ( 1938 ) ; Leeper ( 1944 ) . 

s Tolman (1938a). 
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R — Types of motor response required 
3 (OBO) — Cumulative nature and number of trials 6 
P — Pattern of preceding and succeeding maze units 

II. Individual Difference Variables 

H — Heredity 
A — Age 

T — Previous training 

E — Special endocrine, drug or vitamin conditions 

It is possible to study the effect of such variables on resulting be- 
havior. The usual learning curve is a plot of the functional relationship 
under stated conditions. These are the behavioral “facts” about learning. 
It is the effort to explain the facts which leads to theories. 

Tolman's explanation rests on intervening variables. These are in- 
ferred processes between the independent variables (stimuli/ etc.) and 
the dependent variables (responses, etc.). The preliminary list as pre- 
sented co-ordinates one intervening variable with each of the environ- 
mental variables. 


Intervening Variable 

Demand correlated with 

Appetite 

Differentiation 

Motor skill 

Hypotheses 

Biases 


Environmental Variable 
Maintenance schedule 
Appropriateness of goal object 
Types and modes of stimuli provided 
Types of motor response required 
Cumulative nature and number of trials 
Pattern of preceding and succeeding 
maze units 


Although the intervening variables may sound subjective, in princi- 
ple each can be given objective definition and measurement through a 
defining experiment in which everything else is held constant except the 
correlative environmental variable while that one is systematically varied. 
Demand, for example, may be expected to increase with the number of 
hours since feeding, but the relationship between food deprivation and 
demand is not a simple one. It must be studied empirically. The same 
holds for each of the intervening variables. 

' Having thus established a basis for inferring the value of the inter- 
vening variable from the antecedent conditions, the next stage in theory 
construction is to find the equations relating intervening variables to 
behavioral outcomes, as these intervening variables are simultaneously 
varied. 


ft 2(OBO) is a shorthand formula which means some consequence or summation of 
previous experiences in which one occasion (O) has led through behavior (B) to an- 
other occasion (O). The occasions are such features as a choice point, a goal at the 
left, and so on. 
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In spite of the clear outline of what a systematic theory ought to be, 
Tolman has nowhere attempted quantitative predictions paralleling those 
of Hull, so that his conjectures have not in that sense been put to the 
test. 7 This does not mean that his experiments are unrelated to his theory. 
There are, in fact, many predictions, but they assert that one path will 
be preferred to another, that under one set of circumstances the problem 
will be easier than under another set, and so on. The dimensional analy- 
sis which completes the function is not provided, and Hull's conscientious 
efforts are instructive in showing how difficult that task is likely to be. 

Recent Formulations in Relation to Earlier Ones 

There have been no radical revisions in Tolman’s systematic think- 
ing, although various schemes have been tried out for classifying the 
phenomena of learning, and a number of diagrams have been prepared 
to provide models for thinking about learning and other aspects of be- 
havior. The confident insistence upon building all principles so as to be 
derivable from and applicable to rat behavior (possibly to guarantee that 
the system remained a behaviorism) seems to have been somewhat re- 
laxed. Some of the later papers attempt to generalize the interpretation 
of motivation and cognition in a way useful to social science generally. 
In these later papers more illustrations come from child-rearing and hu- 
man behavior than in the earlier writings. 

The most elaborate model since the first book holds to the original 
pattern of independent variables, intervening variables, and dependent 
(action) variables. 8 The changes from earlier to later models do not re- 
flect any fundamental reorientation, although there are some vocabulary 
differences. The three chief intervening-variables in the 1951 model are 
(a) the need system, ( b ) the belief -value matrix, and (c) the behavior 
space. 

The need system bears a family resemblance to the drive theories 
with which we are already familiar. The needs arise either through 
physiological deprivation or through psychologically defined drive con- 
ditions. 

The behavior space in which 'locomotion” takes place is a space of 
objects in various places, with distance and direction, as perceived by the 
actor at a given moment. These objects have positive or negative 
"valences” on them, and the actor is attracted or repelled by them. The 
behavior space is very similar to Lewin s life space. We shall conside** 

7 Some starts have been made, e.g., Tolman ( 1939 ) ( 1941 ). 

s Tolman (1951b) (1952b). 
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this kind of construct when Lewin's views are discussed in Chapter 8. 

If we have become accustomed to stimulus-response language, we 
are likely to be confused initially by some of the language the cognitive 
theorist uses, especially his physical or geometrical metaphors. When 
the stimulus-response psychologist refers to an object in space, he means 
that physical object, or at least those aspects of it which, as stimuli, can 
impinge upon the subject's receptors. When the cognitive theorist refers 
to an object in space, he is commonly talking about that object as per- 
ceived , or as it is expected to be when perceived. It is important to re- 
member that perception and perception-like processes are central for him. 
Thus when Tolman assigns valence to an object he does not mean that 
some mysterious attracting or repelling force has been added to a physi- 
cal object. He means only that the object as perceived or as expected is 
as an object-to-be-sought or as an object-to-be-avoided. We may recall 
William James' description of a nestful of eggs as perceived by a broody 
hen: “the utterly fascinating and precious and never-to-be-too-much-s at- 
upon object” For the hen, the nestful of eggs has positive valence. 

The belief-value matrix holds most interest as a possible supple- 
mentation to learning theory. This matrix (or, really a set of matrices) 
includes learned categorizations and differentiations concerning environ- 
mental objects. Objects categorized in a common class have gradients of 
generalization built up, so that the objects come to have a preferential 
or hierarchical order, similar in effect to Hull's gradients of generaliza- 
tion or his habit-family hierarchy. Tolman points out how little infor- 
mation we have about any of these gradients, except for simple sensory 
qualities. What he has in mind is something more complex. He uses as 
an illustration the functional dimension described by Sears and his co- 
workers for fantasy aggression in the doll-play of children. 9 These in- 
vestigators found that doll-figures showed a consistent order of “psycho- 
logical distance” from the child, as revealed by the amount of aggression 
either attributed to them, or addressed to them. The order — from near 
to far— is the child itself, then the parent doll of the same sex, the parent 
doll of the opposite sex, the child doll of the same sex, the child doll of 
the opposite sex, then the baby doll, then puppets or- imaginary forces. 
Tolman assumes that such a gradient would be learned in our culture 
and would constitute then one item in the belief-value matrix. 

To carry further with this system, Tolman points out that it would 
be necessary to provide operational definitions for all the following terms: 

0 E,g.» Sears { 1951 ). 
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VARIABLES TO BE OPERATIONALLY DEFINED 10 

Need Systems 

a. List of needs 

b. Magnitude of a given need at a given moment 

Belief-Value Matrices 

c. The magnitude of the gratification and deprivation ‘Values” of the 
given matrix at a given time. 

d. The shape of the cathexis belief attaching the various types of goal 
object (arrayed along a given generalized dimension) to the gratifi- 
cation end of a given matrix. 11 

e . The shape of the means-end belief attaching types of means object 
(arrayed along a given generalization dimension) to a given type of 
goal object or subgoal object. 

Behavior Spaces 

f. Perceived qualities, distances, and directions in the behavior space. 

g. The strength of a need-push. 

h. The strength of a positive or negative valence. 

i. The strength and direction of a field force. 

/. The identification of a locomotion. 

It is evident enough from the unfinished business implied in this 
list that, whatever new directions are proposed for the model, the new 
model is not yet ready for detailed critical appraisal. 

SIGN LEARNING 

Sign Learning as an Alternative to Response Learning 

Stimulus-response theories, while stated with different degrees of 
sophistication, imply that the organism is goaded along a path by inter- 
nal and external stimuli, learning the correct movement sequences so that 
they are released under appropriate conditions of drive and environ- 
mental stimulation. The alternative possibility is that the learner is fol- 
lowing signs to a goal, is learning his way about, is following a sort of 
map — in other words, is learning not movements but me anings. This is 
the. contention .of Tolman’s theory of sign learning. The organism learns 
sign-significate relations; it learns a behavior route, not a movement pat- 
tern. Many learning situations do not permit a clear distinction between 

10 The list is reproduced verbatim from Tolman ( 1951b), page 334. 

11 A cathexis belief is a preference or selective appetite whereby one goal object is 
valued above another as a means of obtaining gratification. The geometrical concept 
(“shape of cathexis belief”) refers to some sort of scaling of a series of possible goal- 
objects (e.g., foods) on a preference dimension, some of them being closer together 
some farther apart. (Tolman, 1951b, pages 337-338.) 
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these two possibilities. If there is a single path with food at the end and 
the organism runs faster at each opportunity, there is no way of telling 
whether his responses are being stamped in by reinforcement or whether 
he is guided by his immanent purposes and cognitions. 

Because both stimulus-response and sign learning so often predict 
the same behavioral outcome, it is necessary to design special experiments 
in which it is possible to favor one theory over the other. Three situa- 
tions give strong support to the sign-learning alternative. These are ex- 
periments on reward expectancy, on place learning, and on latent learn- 
ing. 

1. Rew ard expectanc y. One of the earliest and most striking observa- 
tions on reward expectancy was that of Tinklepaugh (1928). In’ his ex- 
periment, food w as placed under one of two containers while the monkey 
was looking but prevented from immediate access to the cans and food. 
Later the monkey was permitted to choose between the containers and 
showed skill in choosing correctly. The behavior which is pertinent here 
occurred when, after a banana had been hidden under one of the cups, 
the experimenter substituted for it a lettuce leaf (a less preferred food). 
The monkey rejected the lettuce leaf and engaged in definite searching 
behavior. Somewhat the same sort of behavior was found by Elliott 
(1928) when the food in the goal-box of a rat maze experiments^ 

''changed from'bian mash to sunflower seed. More systematic experiments 
have been carried out since with ^c himpanzees. 12 There is little doubt 
that animals have some sort of precognition or expectancy of specific 
goal objects. Under those circumstances, other goal-objects produce signs 
of behavior disruption. Such behavior means that the sign-learning theory 
is appropriate; it does not, of course, mean that other theories may not 
attempt to deduce the behavior from other principles. 

2. Place learning. Experiments on place learning are designed to 
show that the learner is not moving from start to goal according to a 
fixed sequence of movements, such as would be predicted from reinforce- 
ment theories, but is capable of behavior which is varied appropriately to 
changed conditions, a s thoug h he “knows” where the goal is. There are 
three subtypes of these experiments. 

The first subtype of the place-learning alternative to response learn- 
ing leaves the form of the path intact but interferes with the movement 
sequences in getting from start to goal. In one experiment, rats that had 
learnep to run a maze straight on, after surgically produced cerebellar 
^•Cowles and Nissen (1937). 



TOLMAN’S SIGN LEARNING 193 

damage were unable to run the maze except in circles, but even so were 
able to run without error. 13 They could not have been running off the 
earlier learned sequences of kinesthetic habits. In another, rats were able 
to demonstrate what they had learned by swimming the correct path 
after having been trained in wading it. 14 

The second subtype of place-learning experiment sets a movement 
habit against a spatial habit and determines which is the more readily 
learned. Tolman and his collaborators 13 arranged an elevated maze in 
the form of a cross, as shown in Figure 25. The response-learning group 
was started in random alternation from either Sj or S 2 , always finding 
food by turning to the right. That is, food was at F t when the start was 



FIGURE 25. Maze used to test the relative ease of learning either a response 
which brings reward or the place at which reward is found. By starting irregularly 
at Si and S 2 , but finding reward at the same food box each time, one group of rats 
turns now to the right, now to the left, to find food always at the same location. These 
are the place learners. By starting at either Si or S 2 , but always finding food as a 
result of turning the same way (right or left), another group is taught always to make 
the same response, but to find food at diffeient places, depending upon the starting 
point. Place learning, under the conditions of the experiment, is found to be easier 
than response learning. From Tohiif ■ R ic T V. 1 (1946), page 223. 

Sj and at F 2 when the start was S 2 . The place-learning group, by con- 
trast, always went to the same place for food. This meant that if running 
to F 1? a right turn would be required when starting from S x and a left 
turn when starting from So.fThe place-learning group was much the more 
successful. The eight rats of the place-learning group all learned within 
8 trials, so that the next 10 trials were without error. None of the eight 
rats of the response-learning group learned this quickly, and five of them 
did not reach the criterion in 72 trials. gCJnder the circumstances of an 

18 Lashlewand Ball ( 1929). 

14 MgufefrCin^ £1930). 

Ritchie, and Kali^h (1946) (1947). 



194 THEORIES OF LEARNING 

elevated maze with many extra-maze cues, it is clearly demonstrated 
that place learning is simpler than response learning. 16 

The third subtype of place-learning experiment involves the use of 
alternative paths when a practiced path is blocked. An early form of the 
blocked-path experiment is that of Tolman and Honzik ( 1932a) which is 
said to demonstrate inferential expectation, or insight. The main features 
of the arrangement are as follows. There are three paths (1, 2, and 3), 



FIGURE 26. Maze used to test insight in rats. The paths become established as a 
hierarchy according to length. Path 1 preferred to Path 2, Path 2 to Path 3. If Path 1 
is closed by Block A, the rats run by Path 2. If Path 1 is closed by Block B, the rats 
run by Path 3 if they have " ‘insight” that the barrier closes Path 2 as well as Path 1. 
From Honzik and Tolman (1930a), page 223. 

in that order of length from shortest to longest, and hence in that order 
of preference (Figure 26). In preliminary training, when path 1 was v 
blocked, a preference was established between path s 2 and 3 fo r the 
shorter of these paths. Only when path 2 was blqcked also, did they 
run by path 3. We may somewhat oversimplify by saying ^that a familiarity 
with all paths and a preference for them in the order. 1, 2, 3 was estab- 

16 The importance of the extra-maze cues has been demonstrated by Blodgett and 
McCutchan (1947) (1948), who have other criticisms of the experiment cited. See 
the experimental reply, Ritchie, Aeschliman, and Peirce (1950). For other criticisms 
of space-learning experiments see Gentry, Brown, and Kaplan (1947). 
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Iished in preliminary training. An important feature of the maze design, 
crucial for the test, was that paths 1 and 2 hgd a common segment lead- 
ing to the goal. Previously the block in path 1 had been placed, before 
this common path; then the rat, after backing up from the block, ran to 
path 2. Now in the test the block was placed farther along path 1, so that 
it fell in the common path. Would the rat in backing out again run by 
the second preference, path 2, and be frustrated, or would it “see” that 
path 2 was also blocked? What the rats did, predominantly, was to avoid 
path 2, and to take the path ordinarily least preferred, the long path 3, 
but the only one open. Again the hypothesis is supported that the rat 
acted in accordance with some sort of “map” of the situation, and not ac- 
cording to blind habit^ or according to the automatic running off of habits 
in hierarchical order. 17 " 

^One of the several later experiments using blocked paths also in- 
volved rotation of the starting table through 180° on test trials, as illus- 
trated in Figure 27. 1S The original T-maze used in training is diagrammed 
at the left. From the table-top A the F x rats learned to turn left, in the di- 
rection of the light L r They were rewarded by food at the end of the run, 
in the place signallized by the light. The F 2 rats learned to turn right, in 
the direction of the light L 2 . The figure on the right shows how the table 
was moved from the test trials. It was not only moved, but provided with 
10 radiating paths, in addition to the original straight-ahead path, now 
blocked. You would predict from a naive place-theory that the rats, know- 
ing their way around, would take that path leading most directly toward 
the light where they had been fed. With rare exceptions they failed to 
choose the direct path. They did, however, show a direction reversal, the 
F t rats now predominantly turning right rather than left, the F 3 rats 
turning left rather than right. They responded to the rotation of the 
starting point all right, but they tended to choose the extreme paths 
running most directly toward the appropriate wall rather than directly 
to the place where they had been fed. While the results thus lend some 
support to a directional-orientation theory, they show that a rafs “cogni- 
tive map” (if it exists) does not correspond in any simple or direct man- 
ner to the rat s experienced spatial reality. 

3. Latent Learning . In addition to the experiments on reward ex- 

V r ~ ' “’"*“"** 

17 The Tolman and Honzik experiment has been criticized by other experimenters 
who have shown that the results, while reproducible, are easily disturbed by the 
manipulation of experimental variables such as alley width which would, fine be ex- 
pected to make insight impossible. See Evans (1936), Harsh (1937), Ke’ and Hill 
(1936), Kuo (1937). 

« Ritchie (1948). 
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pectancy and on space learning, a third variety of experiment bears im- 
portantly on sign learning. These are the latent-learning experiments 
to be described more fully later in this chapter. The latent-learning ex- 
periments show that an animal can learn by exploring the maze, with- 
out food reward, so that, when reward is later introduced, performance 
is better than that of rats without this exposure, and sometimes as good 
as that of rats with many previously rewarded trials. The “latent learn- 
ing” consists of knowledge of the maze, not revealed in choice of the 
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FIGURE 27. Place orientation in the rat. The dotted figure on the right shows 
the T-maze used in training, the solid figure the rotated maze, with radiating paths, 
as used in testing. In testing, the direct path from the table-top to the original choice- 
point was blocked. After fUtchie (1948), page 664. 


shortest path from entrance to exit until the rat is motivated to make 
that choice. The experiments, beginning with those of Blodgett (1929), 
have been interpreted as critical of reinforcement explanations of learn- 
ing, although, as we have seen, the issue has been somewhat dulled by 
changes in Hulls interpretation of reinforcement. 

In his own summarization of the experimental work supporting his 
theory, Tolman (1948b) lists two additional varieties as significant: the 
experiments on “hypotheses” in rats, X9 and the experiments on vicarious 
trial and error (VTE).*° The experiments on hypotheses led to the con- 
clusfc^Nfhat in a four-choice maze the animals adopted systematic modes 
H\er sky (1932u). 

- h Muenzinger (1938), Tolman (*1939), Geier, Levin, and Tolman (1941), Tackson 
(1943). J 
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of solution, such a choice being somewhat tentative, so that one mode 
would be rejected for another. These experiments furnished the back- 
ground for a controversy over “continuity” and “discontinuity” in dis- 
crimination learning, the hypothesis experiments furnishing support for 
the “discontinuity” interpretation. 21 Vicarious trial and error refers to 
the vacillation at a choice-point before the animal is “committed” to one 
or the other choice. This active comparing of stimuli appears to Tolman 
to support his viewpoint that perceptual or cognitive processes are promi- 
nent in controlling behavior at a choice point. 

Expectancy verms Habit 

Can a more precise formulation be made of the sign-learning theory 
— the theory to which the results of all of these experiments are said to 
conform? It has sometimes been characterized as a theory of “what leads 
to wha-tr a theory of signs, significates, and behavior routes. There are 
really two centra^ problems in a theory of this kind: first, how is the 
expectation acquired that a given sign will lead to its significate, and, 
second, how is this expectation translated into action? We begin with 
the problem of acquiring the expectation. 

The theory is that with repeateci experience of a sequence of events 
leading to a goal the probability is discriminated that the given behavior 
will eventuate in the expected end result The result (for behavior) is 
faster running and blind-alley elimination, exactly as if a habit were 
being strengthened by reward. Hence some ingenuity is required to iso- 
late circumstances in which the expectancy interpretation is more ap- 
propriate than the habit-strengthening one. While the foregoing account 
of experiments gives qualitative support to a cognitive interpretation of 
learning, if expectancies conform to probabilities, a more quantitative 
approach should be possible. In a very stimulating paper Tolman and 
Brunsui k J 1935) proposed that the causal texture of the environment is 
not such as to permit firm expectations but that predictions must often 
be made on the basis of probabilities. In an experimental test of the 
proposal, Brunswik (1939) presented his rats with such a contingent 
environment, in which food might sometimes be found on the right, some- 
times on the left. He found that there was some measure of agreement 
between the choices of the rat and the probability that food would be 
where he went for it. 

An important supplementary finding in Brunswiks experiment is 
that the nature of the goal provides an emphasis that makes a great dif- 

21 For a further discussion of this issue, see Chapter 12. 
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ference in the behavioral response to.. the .discriminated probability. He 
found, for example, that rats rewarded on one side of a T-maze 1 00 per 
cent of their runs to that side, and on the other side in 50 per cent of 
their runs to that side, showed a preference for/tiie two sides in roughly 
the 2:1 ratio of these reward schedules. This preference was tested dur- 
ing the last eight trials of training. During these trials, the rats ran to the 
100 per cent rewarded side in 72 per cent of the runs, and to the 50 per 
cent rewarded side in 28 per cent of the runs. If, however, the partial 
reward side was also “dangerous” on half the trials (a shock being used 
instead of non-reward), this cancelled the effect of the intermittent re- 
ward, and learning was equivalent to that in a conventionaLT-maze 
experiment in which one side only was rewarded. This, and related re- 
sults, led Brunswik to state: 

Discrimination of probabilities tends to increase with the rate of the proba- 
bility of emphasis on the two sides of the probability discrimination problem. 22 

Common experience amply justifies this supplementation to a sim- 
ple probability theory. One might walk into a store to try to make an 
unusual purchase even though the probability was only 1:10 that the 
store carried the merchandise sought. But one would not buy a ticket 
on an airline where the probability of a crash was 1:10. Discriminated 
probabilities are weighted before they emerge in action. 

In order to compare an expectancy theory with a reinforcement 
theory, Humphreys performed several experiments of the conditioned- 
response type. In human eyelid conditioning, it had already been shown 
that conditioned discrimination was more rapid when subjects knew 
which stimulus of a pair was to be positive, which negative, than when 
the probabilities had to be established through experience with the 
stimuli. 23 The conjecture that the subjects were responding according 
to their expectations was a plausible one. Humphreys went on to show 
that random alternation of reinforcement and non-reinforcement led' not 
only to as much conditioning as reinforcement every trial, but was fol- 
lowed also by greater resistance to extinction. 24 During extinction, re- 
sponses increased in frequency at first, and then fell off. This'would be 
anticipated on an expectancy theory because the likelihood of reinforce- 
ment was great after a non-reinforcement during the body of the ex- 
periment, and greater still after two non-reinforcements because there 
were never more than two successive non-reinforcements during the 

22 Bmnswik (1939), page 185. 

22 Hilgard, R. K. Campbell, and W. N. Sears (1938). 

24 Humphreys (1939a). 
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training sessions. But even after this high point, extinction was gradual 
Humphreys conjectured that a shift from intermittent reinforcement to 
uniform non-reinforcement must have led with difficulty to the hypothe- 
sis of uniform non-reinforcement. Because the experiment was done with 
human subjects, a direct test was possible on the verbal level, and 
Humphreys designed and carried out a simple experiment which con- 
firmed his conjecture. 23 

For the study of verbal expectations, two lights were arranged on 
a board. When one of these lights was turned on, the subject was asked 
to guess whether or not it would be followed by the other light. Half 
the subjects were “trained” with the second light invariably following 
the first. They came gradually to guess in a high percentage of the cases 
that the first light would be followed by the second, in agreement with 
their experience. The other half had the second light turned on only in 
random alternation, so that half the time it did not appear. They guessed 
at chance level. Within the conditioning experiment, the intermittent- 
reinforcement group performed like the group uniformly reinforced; but 
in the verbal experiment, the groups behaved entirely unlike. Does this 
difference not invalidate the comparison? On the contrary, it is an essen- 
tial part of it, because it is necessary to distinguish between performance 
and expectation. In the conditioning experiment, uncertainty leads to 
conditioned responses as well as certainty because it is a “danger” situa- 
tion ; blinking is as easy as refraining from blinking, and there is no 
penalty for an “erroneous” response, that is, for blinking to the condi- 
tioned stimulus alone. In the verbal experiment, response is more nearly 
representative of “pure” expectation, because a false guess is subjectively 
interpreted as a mistake in a way in which a false conditioned response 
is not. 

The crucial portion of the experiment is that in which extinction is 
simulated, that is, in which the first light is never again follow ed by the 
second. Humphreys’ results are plotted in Figure 28. It is seen that file 
group which h ad been train e d on the every-trial “reinforcement” quickly 
developed the hypot hesis o fjmiform n on-reinforcement, and ceas ed to 
ex pect t he second light. The group trained with intermittent “reinforce- 
ment” showed the rise in expectation whic h the theory demands, and 
the slow acceptance of the hypothesis that there would fee no more sec- 
ond lights. 36 

25 Humphreys (1939b). 

26 The results of intermittent reinforcement, both within conditioning experiments 
and within verbal experiments, have proved to be of considerable theoretical interest. 
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A provisional expectancy is an hypothesis. When a situation is not 
yet structured so that path-goal relationships become clear, behavior in 
relationship to the situation may either be “random” or “systematic.” If 
it is systematic, it may be said to accord with an hypothesis. Score 
averages from groups of animals tend to conceal the differences between 
chance approaches and more orderly approaches, because one animal’s 
responses cancel another’s unless they happen to correspond to the 



"EXTINCTION" TRIALS 


FIGURE 28, A phenomenon like extinction in an experiment on verbal expecta- 
tions, Prior to yielding the data in this figure, one group had regularly experienced 
a second light following a signal light, and had developed the expectation that a 
second ligh \\ :.Y .* .. - fc’Yw. This condition simulates 100 per cent reinforce- 
ment in a : »■ For the second group, the signal light had been 

followed by tfce 'expected light only half the time. This condition is like 50 per cent 
reinforcement in conditioning, and leads to uncertainty as to whether or not the signal 
will be followed by a second light. Beginning with the trials shown in the figure, the 
signal light was never followed by the second light, thus simulating extinction. What 
is recorded is the per cent of subjects guessing on each trial that the second light 
would occur. Following regular reinforcement the change to the expectation of regu- 
lar non-reinforcement occurs more readily than following intermittent reinforcement. 
After Humphreys (1939b). 

experimenter's plan as to what will be counted correct. By studying the 
presolution behavior of individual animals, systematic attempts at solu- 
tion can often be discovered, solutions of the order: “turn right/’ “alter- 
nate,” “go to the dark.” These are not mere fixed preferences, because 
after one pattern has been used for a while, the animal learner may shift 

and the experimental literature is now sizable. For a useful summary, see Jenkins and 
Stanley (1950), On verbal experiments similar to Humphreys", see Grant, Hake, and 
Horaseth (1951), Hake, Grant and Hornseth (1951), 
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to another one. Krech named such systematic attempts* at solution “hy- 
potheses,” and studied them under a number of conditions. 27 

The full-fledged conception of expectancy thus includes at least the 
following aspects: (1) The organism brings to a problematic situation 
various systematic modes of attack, based largely on prior experiences. 
(2) The cognitive field is provisionally organized according to the 
hypotheses of the learner, the hypotheses which survive being those which 
best correspond with reality, that is, with the jcausal texture of the en- 
vironment. These hypotheses or expectancies are confirmed by successes 
* n gpsl^c^vement. (3) A clearly established cognitive structure is 
available for use under altered conditions, as when a frequently used 
path is blocked. 28 

Confirmation verms Reinforcement 

Because Tolman has been at such pains to protest against the law of 
effect and the principle of reinforcement as essential to learning, it is in 
order to inquire what role he assigns motivation in learning. 

One role assigned to motivation is in relation to performance. Learn- 
ing is used when drives are active. When drives are aroused, a state of 
tension ensues, leading to demands for goal-objects. 29 These tensions 
lead to activity, guided by the expectancies “or cognitive structures avail- 
able. In this role, then, motivation is not really a factor in learning at 
all. It is related to performance, not to acquisition. Because this role of 
motivation is made much of in the criticisms of reinforcement theories, 
the impression is given that learning, the acquiring of cognitive struc- 
tures, is independent of motivation. This is not true in fact, nor is it true 
in Tolman s theory. 

The second role of motivation is in the acquisition of cognitive 
structures. In one of the experiments critical of the law of effect, the 
suggestion was made that what appeared to be the principle of effect 
might better be considered the influence of emphasis. 30 That is, motiva- 
tion determines which features of the environment shall interest the 
learner, and those to which he pays attention. Such factors are influen- 
tial in perceptual acquisitions, which are the genuine substance of place 
learning. Causal sequences of the environment are not all spatial. Some 

27 Krechevsky ( 1932a ) (1932b) (1933a) (1933b). 

28 We postpone until Chapter 12 consideration of the attempts of others to systema- 

2 expectancy theory. 

1 The parallel may be noted with Hulls interpretation of drive as activating habit 
igth into reaction potential. 

30 Tolman, Hall, and Bretnall (1932). 



202 THEORIES OF LEARNING 

are temporal; some are logical or systematic or arbitrary. The goal-object, 
by its presence or absence, verifies or refutes hypotheses. Hence the goal- 
object is essential for the establishment of some features of cognitive 
structure. Latent learning, which dramatizes a relatively unmotivated, 
incidental type of learning, is an extreme instance of the relationship of 
goal-behavior to learning and is probably not a typical case. 

The importance of something like emphasis is well brought out 
in the series of experiments by Muenzinger and his associates, 31 in which 
punishment or obstacles to be overcome, even on the side of the correct 
choice, aid learning. Whether or not punishment will be an aid to learn- 
ing thus depends upon what influence it has on the cognitions of the 
learner. It may be helpful as an emphasizer or harmful as a distractor. 

The principle in Tolman’s system that most nearly replaces reinforce- 
ment is the principle of confirmation. If an expectancy is confirmed, its 
probability value is increased; if an expectancy is not confirmed, its 
probability value is decreased (i.e., it undergoes extinction). Tolman 
distinguishes between confirmation and reinforcement in a passage de- 
scribing the building up of preferences for one restaurant over another: 

Such learning takes place, I believe, not primarily through reinforcement 
in the Hullian sense but merely by the repeated “confirming” experiences of find- 
ing what restaurants lead to what foods by what costs and with what degrees of 
gratification. 32 

The regulatory role of reward is not very different in Hull’s system 
from this role in Tolman’s, though the interpretations differ. According to 
fTolman, the learner acquires knowledge of a means-end relationship; 
according to reinforcement theory, he learns behavior which is followed 
’ by reward. In any case, it would be most misleading to interpret Tol- 
man s position as if reward influenced performance only, and not learn- 
ing. All that he insists upon is th^tJ^aQnng_and performance cannot be 
equated, and the role_ j)f .mo rivation in learning differs m jsome respects 
foraijtemle in performance. This conclusion, originally proposed by 
Lashley (iSS^J^can scarcely be doubted. 

VARIETIES OF LEARNING AND THEIR LAWS 
The 1932 Version 

Learning theorists commonly select one kind of learning problem 
or situation as typical and then proceed to develop a theory appropriate 
to this reference situation. Having constructed a set of principles in this 

31 Mueminger and others, 1904-1952. 

33 Tolman { 1952b), page 396. 
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way, they attempt to show by a logical process that other kinds of learn- 
ing are really at base like the typical one, and hence explicable in the same 
terms. Recognizing this tendency, Tolman selected for review three kinds 
of learning experiments, with their three corresponding doctrines. These 
were conditioned-reflex learning (Pavlov), trial-and-error learning 
(Thorndike), and inventive learning ( Kohler ) . He then gave a sign- 
gestalt "interpretation of each of the three kinds of learning as alternative 
to the usual theory associated with each. He found it useful to preserve 
the typical experiments, which represent a kind of hierarchy from stu- 
pidity to intelligence. The laws applicable to the more stupid situations 
have to be supplemented by additional laws for the higher forms of 
learning. 

In the 1932 version there are three groups of laws: capaci ty laws, 
laws relative to the nature of the material, and laws relative to die man- 
ner of presentation. 

1. Capacity laws. Only organisms can learn. It is evident, therefore, 
that what the organism can learn must depend on what kind of an or- 
ganism it is. That is the reason for capacity laws. 

The list of capacity laws is as follows: 33 

a. Formal means-end-capacities 

b , Discriminating and manipulating capacities 
,^c. Retentivity 

d. Means-end-capacities needed for alternative routes, detours, etc. 

% e. Ideational capacities 

f. Creative instability 

In order to learn conditioned reflexes, the learner must have the 
necessary capacities to form and act in accordance with “sign-gestalt- 
expectations ” That is, the conditioned stimulus serves as a sign that 
the unconditioned stimulus is about to appear and the conditioned be- 
havior is appropriate to the sign. This capacity is named a “means-end- 
capacity. ,> In his later writings Tolman has dropped a number of the 
hyphenated terms which made his book clear, entertaining, but also 
somewhat forbidding. In the list of laws which we have given above, his 
terms have been freely paraphrased to make for easier reading, although 
there may be some loss involved. In addition to the general capacity for 
sign learning, conditioning requires special capacities for discriminating 
and manipulating features of the environment. Finally retentivity is im- 
plied, if the results of earlier conditioning trials are to influence later 
ones. Only capacity laws a> b > and c apply to conditioning. 

The capacities needed for trial-and-error learning are the same as 

33 Tolman (1932b), pages 376-377. 
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those required for conditioning, except that additional means-end- 
capacities are needed because more alternatives are open to the learner. 
The field relationships of alternate routes, detours, final common paths, 
are involved. Additional capacities of ideational sort, permitting compari- 
son of alternatives (a mental “running-back-and-forth”), are probably 
helpful in trial-and-error learning. 

Inventive learning requires all the capacities of the other varieties of 
learning plus creative instability . This is a capacity to break out into 
new lines of activity which have never occurred to the learner before. 

The need for capacity laws seems evident enough, once they are pro- 
posed, though they have been neglected in most learning theories. Even 
Thorndike, strongly identified with the study of individual differences, 
neglects capacity laws in his learning theory. Such a statement as “Any 
response of which the organism is capable can become attached to any 
stimulus to which it is sensitive” 34 implies only Sensitivity capacity and 
response capacity/ and neglects any capacity to establish relations be- 
tween them. It would be grossly unfair to say that Thorndike did not 
recognize differences in learning ability, but it is true that he slighted 
the different kinds of capacities needed for different kinds of learning, 
because all learning was merely the forming of bonds. Hull, who also 
in one of his earlier research interests contributed to the psychology of 
individual differences, 35 only late in his career began to consider indi- 
viduality as something to enter into his learning theory. 36 

2. Laws relating to the nature of the material. In Tolman’s discus- 
sion of these topics, he calls attention to certain “gestalt-inducing- 
conditions” and suggests that they are of the sorts emphasized in gestalt 
studies of perception. 

The list follows: 37 

'a. Togetherness 

fc. 3-i S'hi] 

c . Other gestalt-like laws 

d. Interrelations among the spatial, temporal and other characters of the 
alternatives 

e. Characters in the material favoring new closures and expansions of the 
field 

In relation to conditioned-reflex learning, these laws suggest that 
there must be a “togetherness” of essential signs and their means-end- 
relationship to the thing signified. Tolman states that this is about what 

34 Thorndike ( 1913b), page 15. 

33 Hull ( 1928). 

Hull (1945). 

37 Tolman (1932b), pages 378-385. 
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Thorndike has called “belongingness.” Tolman adds a somewhat similar 
law of fusibility of sign, slgnificate, and signified means-end-relationship 
by which he means a certain naturalness about the situation which makes 
it easier to form a gestalt of the whole. He provides in a third law for the 
possibility „ of new discoveries. He adds one law (d) for trial-and-error 
learning and one for inventive learning (e), to suggest that some arrange- 
ments must be easier than others. He points to Kohler’s observation that 
the ape could learn to use the stick more easily to rake in the food if 
the stick and food were perceived together. 

These laws show great catholicity, but there is no ordering principle 
among them. “Spatial, temporal, and other” can scarcely be said to ar- 
range things dimensionally. They do make a definite bow to the important 
fact that perceptual principles must be understood if the relative ease 
or difficulty of problematic situations is to be made comprehensible. 

3. Laivs relative to the manner of presentation . These are the laws 
inherited largely from association psychology, the ones for which abun- 
dant evidence can be found recorded in M cGeoc h^and Irion’s book 
(1952). The list is as follows: 38 

a . Frequency, recency 

b. Revival after extinction, primacy, distributed repetition, etc. 

c. Motivation 

d. Not “effect” but “emphasis” 

e. Temporal orders and sequences in the presentation of alternatives 

/. Temporal relations between the presentation of certain of the already 
given alternatives and the true solution 

Of these, the first four belong to conditioned-reflex learning, the 
first five to trial-and-error learning, and all six to inventive learning. 

The principles of frequency and recency are accepted in the follow- 
ing form: “The more frequently and more recently the actual sequence 
of sign, means-end-relation and significate have been presented, the 
stronger, other things being equal, this resulting sign-gestalt will tend 
to be.” 30 That is, only in a situation favorable to sign-gestalt formation 
will frequency be effective. The other laws provide opportunity again 
to raise the question of the law of effect, and to make some observations 
descriptive of favorable conditions in trial-and-error and multiple-choice 
experiments. 

As a set, the laws are rather disappointing. They serve as a useful 
reminder of the main tenets of the point of view and of its criticisms of 
prevailing doctrines. They leave much to be asked for on the positive 

38 Tolman ( 1932b), pages 385-389. 

39 Tolman ( 1932b), page 386. 
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side in their lack of sufficient precision of statement so that they can be 
called true or false. 

Leaving the laws in this form makes everything a matter of cor- 
relations between situation and behavior and does not get at the formal 
problem of rigorous definition and measurement of intervening variables. 
This lack has not yet been made up, in spite of later reworkings of the 
list of laws. 


The 1949 Version 


New sets of laws have been proposed by Tolman from time to time. 40 
The “laws” do not have the centrality within his system that Hull’s postu- 
lates have within his, so the revision of the list of laws requires little re- 
interpretation of earlier experimentation. For purposes of understanding 
the restatement of the theory, however, it is useful to review one of these 
later revisions. The one selected for review appeared 17 years after the 
first 41 


In this later paper, Tolman distinguished six “types of connections 
or relations” that get learned, and then proceeded to indicate what was 
known about the “laws” applying to each. 

1. Cathexes. Tolman borrowed the word “cathexis” (plural, cathexes) 
from psychoanalytic theory, where i t refers metapho rically to the energy 
g ^rge^mfff'which an activity is in vested. He gives it a related but more 
restricte<Tme^^ to Lewin’s valence. A cathex is is 

an acquired relationship between a drive and an object. If a hungry 
cfiil3Tj3«fnts a banana, you knolv^at the child has a learned cathexis 
its hunger drive and a banana as a preferred goal-object. This 
is a positive cathexis, for the child is led to seek bananas. A negative 
cathexis works the other way. If a burned child fears a hot stove his 
fear is connected, as a negative cathexis, to the stove, and the cathexis is 
negative because he shuns the stove. 

Tolman believes that drive reduction (reinforcement) is probably 
valid as an explanation for the acquiring of cathexes, but he believes we 
need more experimental evidence. 

Both positive and negative cathexes, he believes, are very resistant 
to forgetting. 


2, Equivalence beliefs. This is Tol man’s substitute for second ary re- 
rcement, whereby an organism reacts to a subgoal (or a subdisturb- 
i, in the case of negative cathexes), as it would in the presence of 


40 Tolman (1934) (1937) (1949a). 
« Tolman (1949a). 
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the goal or actual disturbance. In social behavior, and especially in the 
psychological clinic, we see many evidences of faulty equivalence be- 
liefs. These are often engendered by traumatic experiences. Hence Tol- 
man concludes that the reinforcement principle is valid for equivalence 
beliefs as for cathexes, but it needs to be supplemented by an awareness 
of the importance of traumatic experience. 

3. Field expectancies . These are the rechristened sign-gestalt- 
expectations of the earlier system. They permit the organism to take 
shortcuts, roundabout routes, and make possible latent learning. Tolman 
reasserts his contention that reinforcement per se is not valid in the ex- 
planation of field expectancies, although he acknowledges the importance 
of motivation. 

The presence of reinforcement in a particular locus makes the locus a goal 
which determines what performance will take place but it does not stamp in S-R 
connections though it probably does give a special vividness to that locus in the 
total field expectancy. 42 

Although we know little at present about laws of the formation of 
field expectancies, when we do work them out we shall have to take into 
account perceptual sensitivity, memory, and inference abilities. 

The forgetting of field expectancies is probably more rapid than the 
forgetting of cathexes and equivalence beliefs.. Such forgetting, Tolman 
believes, probably follows the sorts of laws the gestalt psychologists have 
uncovered. 

^ x 4. Field-cognition modes . These modes are of higher order than the 
field expectancies. A field-cognition mode makes a given expectancy- 
formation possible. The mode is based in part upon innate capacities and 
in part on previously established field expectancies. Hence a field- 
cognition mode represents a disposition or readiness^ to acquire field ex- 
pecta ncies of o ne of three varieties: perceptual, memorial, or inferential. 

Tolman has no laws to suggest. Such laws will arise out of at present 
non-existent types of transfer experiment. 

J5. Drive discriminations . Tolman here refers to experiments such as 
those of Hullj£1933) and L eeper (1935), in which the animal learned 
to go one way when hungry, another way when thirsty. Tolman has no 
laws to suggest. 

6. Motor patterns . Because his behaviorism is not of the stimulus- 
response variety, Tolman recognizes that he needs some special principles 
governing the acquisition of motor patterns. For these, Tolman was in 
1949 prepared to accept Guthrie s simple conditioning explanation. 

42 Tolman ( 1949a), page 151. 
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It is evident that these laws have not moved toward precision over 
the initial set, Tolman states his own estimate of what he has done here: 

And, although, as usual, I have been merely prrcrrm^ptT and have not 
attempted to set up, at this date, any precise system of postulates and deduced 
theorems, I have made some specific suggestions as to some of the conditions and 
laws for the acquisition, de-acquisition, and forgetting of these relationships. 4B 

While Tolman has made very thoughtful observations on learning, 
and has been responsible for much original experimentation, he has not 
proposed a system clearly enough defined so that successive approxi- 
mations succeeded in making it more exact and testable. We find, in 
this statement of six kinds of learned relationships, two explained by re- 
inforcement (Hull, Skinner), one explained by simple conditioning 
(Guthrie), and three not explained at all. This is characteristic of Tol- 
man’s role. In addition to making concessions to othe“r interpretations, 
he has consistently pointed out relationships difficult for stimulus- 
response theorists to explain, without himself being able to offer much 
in the way of detailed explanation for these discovered relationships. 

EXPERIMENTS ON LATENT LEARNING 

Reinforcement theory proposes that habits are strengthened by rein- 
forcement; sign-learning theory says that expectancies become estab- 
lished, and that performance is secondarily correlated with expectancy. 
Experiments on latent learning have been used all along to argue against 
reinforcement as an essential principle of learning. 44 The arg uments 
against reinforcement theory had more cogencyjbefore jlulLmodified 
his theory to minimize the amount of necessary reinforcement and to em- 
phasize the factors activating habit into performance. All that can be 
said with confidence, now that Hull has quoted the original latent- 
learning experiments as evidence in support of his predictions, is that 
the details of latent-learning experiments place reinforcement theory 
under strain. But it is equally true* that the details of latent-learning 
experiments likewise place cognitive theories under strain, for an ade- 
quate theory (whether a reinforcement theory or a sign-learning theory) 
has to account for both the positive and the negative results with 
which the literature abounds. Because the latent-learning experiment 
has proved such a fertile area for testing theories, it is* worth examining 
the evidence in some detail. 

43 Tolman (1949a), page 154. 

44 E.g., Tolman (1938b); Thistlethwaite (1951b). 
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Tolman and Honzik’s Experiment 

Following up the work started by Blodgett (192 9), Tolman and 
gonzik„ ( 1930b ) studied the effect of introduction of reward in a Tat- 
maze experiment after the animals had run several days without food. 
The control group, fed each day in the maze, reduced their error and 
time scores much more rapidly than the non-led group, but when food 
was introduced for the latter group, error scores and time scores abruptly 
became alike for both groups. /Thus the non-fed group had apparently 
profited as much by its earlier trials as the fed group. Since this profiting 
did not show in performance, the learning taking place is said to be 
"latent.” The results for error elimination are shown in Figure 29. 



FIGURE 29. Evidence for latent learning in the maze. With no food reward there 
is some reduction in errors, but not as great a reduction as with regular food reward. 
Despite the higher error scores prior to the introduction of food, the group rewarded 
only from the 11th trial immediately begins to do as well as the group that had been 
regularly rewarded. The interpretation is that some learning went on within the first 
10 trials which did not show in performance until the food incentive activated it. 
After Tolman and Honzik (1980b). 

There were three groups: two control groups, one rewarded through- 
out, the other non-rewarded throughout, and the experimental group, 
non-rewarded until the eleventh trial. On the twelfth day, the experi- 
mental group, having been fed but once in the maze, made as few 
errors as the group which had received food in the maze each of the pre- 
ceding days. 

There are several comments to be made on this situation, in its bear- 




210 THEORIES OF LEARNING 

ing on other theories as well as Tolman’s. The maze used was a 14-unit 
one of multiple-T type, 45 arranged with doors between each unit to pre- 
vent retracing. The rat without food at the end still progressed through 
the maze, so that, according to such a theory as Guthrie's, learning con- 
ditions were ideal. The rat had no opportunity to unlearn what it last 
did in each segment, which was of course to go through the door to the 
next one. The results are critical of Guthries theory not because the rat 
shows latent learning but because it does not show enough learning 
when there is no food at the end of the maze. (It is to be recalled that 
the Guthrie and Horton cats often did not eat when they left the prob- 
lem-box; their behavior was said to be fixed because they left the box. ) 
The Guthrie explanation is not sufficient, because the fed rats learned 
much better than the unfed ones. 

Reinforcement theories, in their older form, were put to it to ex- 
plain latent learning. In the older forms of “law of effect” theory, the 
presumption was that reinforcement worked directly upon response 
strength, so that all that was learned should be revealed in performance. 
Reinforcement theorists usually made much of the decrease in errors 
during non-rewarded trials in latent-learning experiments as evidence that 
\$ome reinforcement was present before reward was introduced. But they 
could really only account for that much gain by reinforcement, any pre- 
viously concealed gain, shown when reward was introduced, being 
unaccounted for. In Hulls later theory, however, in which habit strength 
(sHr) is acquired independently of the magnitude of reinforcement, this 
difficulty no longer holds. Under minimum values of incentive magni- 
tude K (such incentives as are present without food reward, including ex- 
ploration, return to home cage, etc. ) habit strength will be practically 
“latent,” to be revealed in reaction potential when large values of K' 
are created by introducing the food reward. 

Tolman s explanation is that the non-reward situation was a good one 
for learning the spatial relations of the maze. Every unit had one dead 
end and another end with a door. The last thing done in each case was 
to go through the door. Recency (which is accepted as favoring sign- 
gestalts) would strengthen the cognition that the door was the way from 
one segment to the next, though under non-reward conditions there 
were no reasons for the rat to show what it “knew.” The substation of 
the food at the end of the maze, a highly demanded goal-rjbject, led 
the rat to use its cognitive map, to take the turns which led trom one unit 
to the next Hence the sudden reduction in errors. Th the group with 
46 Stone and Nyswander { 1927 ). ; 
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late introduction of reward had a tendency to do even better than the 
regularly rewarded group may conceivably have been due to a more 
thorough exploration of the maze in the early trials by the non-rewarded 
group, with consequent better orientation when finally running toward 
the goal. 

Five Varieties of Latent-Learning Experiment and Their 

Results 

An upsurge of interest in latent learning led to many experimental 
reports, of which 48 have been classified by MacCorquodale and Meehl 
into five categories. 16 Their classification is followed here, as well as 
their scoring of the experiments as yielding positive or negative results. 

1. Unrewarded trials with later introduction of relevant reward. 
This is the type of experiment that Blodgett performed and Tolman and 
Honzik followed up. Of nine studies cited, seven are interpreted as 
positive, two as negative. 47 

2. Free exploration followed by relevant reward . The animal is 
permitted to run around in the maze for some time. Then it is either 
fed in the goal-box prior to the test runs, or runs the maze as usual to 
food at the end. In either arrangement a control group is necessary for 
comparison. Of five experiments of this kind, all were positive. 48 

It has been reported that in free exploration animals tend to reduce 
their entrance into culs, thus producing latent learning through a kind 
of “satiation” of blind-alley entrances. 46 But the problem turns out not 
to be that simple, for more recent workers have found the most success- 
ful latent learners to be the ones that made the largest number of cul 
entries during exploration. They did not decrease their cul entrances 
during exploration. 50 

3. Location of incentives learned under satiation and tested under 
relevant drive . In this third variety of experiment, animals satiated for 

46 MacCorquodale and Meehl ( 1954), pages 199-213. The last of the latent learning 
experiments cited is dated 1952. These writers leaned heavily upon a review by 
Thistlethwaite ( 1951a), both for earlier experimental citations and for the major classi- 
ficatory scheme. Thistlethwaite’s review led to two replies, Kendler (1952) and Maltz- 
man (1952), to which he responded, Thistlethwaite (1952a). 

47 Positive results: Blodgett ( 1929), Elliott ( 1929), Herb ( 1940), Simmons ( 1924), 
Tolman and Honzik (1930b), Wallace, Blackwell, and Jenkins (1941), and Williams 
(1929). Negative results: Meehl and MacCorquodale (1951), Reynolds (1945b). 

48 Positive results: Buxton (1940), Daub (1933), Haney (1931), Karn and Porter 
( 1946), Lashley (1918). 

49 MacCorquodale and Meehl { 1951 ). 

50 Kimball, Kimball, and Weaver (1953). Unfortunately, they used different alley 
widths and younger rats than MacCorquodale and Meehl. 
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food and water run a T-maze or a Y-maze with food and water in oppo- 
site arms, or in one arm. Then, under test conditions, the “hungry” or 
“thirsty” animal demonstrates latent learning if it can choose the appro- 
priate turn to reach the relevant incentive. Of nine reports of this variety, 
seven yielded positive results, two negative results.' 1 

To explain this kind of learning, the reinforcement theorist falls back 
upon Hulls fractional antedating goal response (r G ). This is, of course, 
the reinforcement theorist’s equivalent of “cognition.” 52 The issue, when 
a reinforcement theorist invokes r G> is not over whether or not the “ex- 
pectancy” language is appropriate, for such language is always appro- 
priate when fa's are involved. The issue is over the scientific precision 
and fertility of the “expectancy” interpretation as contrasted with the 
r G one. Because r G ? s have the dimensions of ordinary responses they can 
be treated with a quantitative precision. Because cognitions lack such 
dimensions, similar treatment is not feasible. Critics of the r G concept 
do not deny the possibility of precision, but they believe that pre- 
cision in the use of r G has not been achieved, and that some of the sug- 
gested uses of r G are faulty. For example, if there are cases in which 
palpable r G ’s do not exist, then the r G theory is insufficient. 

4. Location of incentives learned under strong irrelevant ( com- 
peting ) drives. If food is present in one arm of a T-maze and water in 
the other, a thirsty animal, through a forced-run procedure, sometimes 
encounters the dry food, without eating it, and sometimes encounters the 
water, and drinks it. Under these arrangements, will the animal, if made 
hungry, run to the side where food had been encountered? This is the 
only variety of experiment in which the negative results outweigh in 
numbers the positive ones. Of eighteen studies, only seven are positive, 
and eleven are negative. 53 

These experiments point to complexities that call for detailed analy- 

51 Positive results: MacCorquodale and Meehl (1951), Meehl and MacCorquodale 
(1948), Seward, Levy, and Handlon (1950), Spence, Bergmann, and Lippitt (1950), 
Spence and Lippitt (1940), Szyraanski (1918), Thistlethwaite (1951b). Negative re- 
sults: Kendler (1947a), Maltzrnan (1950). Kendler and Levine (1953) have added 
some positive results. 

52 A defense of the r G concept in the interpretation of irrelevant motivation-incentive 
experiments can be found in Spence, Bergmann, and Lippitt (1950), Kendler, Karasik, 
and Schrier (1954). Some difficulties with the n> concept are pointed out by Thistle- 
thwaite (1951b). See also Heron (1949). 

5S Positive results: Bendig (1952), Christie ( 1952), Diesenroth and Spence (1941), 
Strange (1950), Thistlethwaite (1952b), Walker (1951), Walker, Knotter, and De- 
Valois (1950). Negative results: Christie (1951), Fehrer (1951), Gleitman (1950), 
Grice (1940), Kendler (1947b), Kendler and Mencher (1948), Kendler and Kanner 
( 1950),.jLittman ( 1950), Shaw and Waters (1950), Spence and Lippitt (1946), 
Walker (1Q48). * 
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sis of what is happening. There is an asymmetry, for example, between a 
hunger-food sequence and a thirst-water sequence. One might suppose 
that a hungry and thirsty animal would be more likely to drink water than 
to eat dry food. Yet Heron ( 1949 ) found that hungry and thirsty animals 
chose the food side in a familiar apparatus in 100 per cent of their choices, 
where the other side led to water! Animals that were satiated, that is, 
that had all they would eat or drink, chose the water side in 92 per cent 
of the trials. Heron’s conjecture that perhaps the choice was made on the 
basis of full or empty stomach, rather than on the basis of drive discrimi- 
nation, was supported by the choices of animals whose stomachs were full 
of water. They, like the food-and-water satiated animals, chose the water 
side, insofar as they made any choices at all. 

Another complexity is introduced because under strong drive some of 
the animals have to be forced down the alley leading to the irrelevant 
incentive. Walker, Knotter, and De Valois (1950) showed that animals 
that had to be forced demonstrated latent learning less frequently than 
animals not forced. They showed also that animals with strong position 
preference during the early runs were less likely to show latent learning in 
the test runs. 

When all of these complexities are unravelled, the focus of die con- 
troversy over theory may very well change, just as it has already changed 
from the existence of latent learning to the circumstances under which 
latent learning occurs. 

5. Exploration of paths leading to distinctive goal-boxes , followed 
by prefeeding in one goal-box prior to the test run . This design differs 
from the second type ( exploration of a maze ) in that discriminable goal- 
boxes are used. It is in reality a variant of Maier’s (1929) reasoning ex- 
periment, in which many of the same arrangements are involved. Hence 
Maier’s experiment, and the successful repetitions of it, give some addi- 
tional supporting evidence. The typical experiment with this design uses 
a T-maze with two easily distinguishable end-boxes. After the rat is 
familiar with the arms of the maze and the end-boxes to which these lead, 
it is fed in one of the end-boxes. It demonstrates latent learning if, when 
placed at the entrance, it chooses the side leading to the appropriate 
goal-box. Of seven studies of this kind, four are classified as positive, 
three as negative. 54 

What conclusions can we draw from this array of contradictory 

54 Positive results: Gilchrist (1952), Iwahara and Marx (1950), Seward (1949), 
Tolman and Gleitman (1949). Negative results: Denny and Davis (1951), Seward, 
Datel, and Le\y (1952). (In the last study, one of three replications came out posi- 
tive. ) 
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evidence? So far as sheer "score” is concerned, of the studies reviewed 
by MacCorquodale and Meehl, thirty were reported as positive, eighteen 
as negative. There is no longer any doubt but that, under appropriate 
circumstances, latent learning is demonstrable. 

Other Varieties of Latent-Learning Experiment 

Because the possibilities of learning under irrelevant motivation- 
incentive conditions are enormous, the five foregoing experimental varie- 
ties are not exhaustive. 

An ingenious and novel test of latent learning has been proposed by 
Muenzinger and Conrad (1954). They suggest that latent learning can 
be tested by its interference with a novel task as well as by its accelera- 
tion of learning in the original task. The interference ("negative trans- 
fer”) method permits a distinction between two phases of latent learning. 
In the first phase the non-rewarded animal becomes "maze-wise” without 
learning the true path. If in this phase the animal is moved to a new 
maze that is the mirror-image of the first, and run with reward, he shows 
positive transfer, that is, learns more successfully than control rats with- 
out the prior non-rewarded experience. In the second phase, “true-path” 
learning occurs, and produces negative transfer when the rat is run in 
the mirror-image maze with reward. This second phase demonstrates 
genuine latent learning of the true path. In the reported experiments the 
first phase lasted for 10 trials (five on each of two days), and the last 
phase began only on the third non-rewarded day. If the reward is intro- 
duced earlier (on the first or second day) true-path learning sets in 
earlier, and negative transfer is demonstrable in the mirror-image maze. 55 
These findings suggest several important supplementations to the previous 
interpretation of latent learning — both as to what is happening in the non- 
rewarded runs and what happens when the reward is introduced. 

There has been a long history of experimentation on “incidental 
learning” in human subjects, that is, on the learning of something tan- 
gentially related to an ongoing task upon which attention is directed. 
These experiments have recently been assimilated to latent learning, for 
they are appropriately classified as learning under low degrees of rele- 
vant motivation, such learning being followed by more direct testing. For 
example, Stevenson (1954) finds striking increase in latent-learning 
ability with age during the preschool years, Postman and Thuma (1954) 

55 Kanner (1954) also shows that rewarded animals learn somewhat more than un- 
rewarded ones. 
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distinguish between the subject's recognition of the presence of the inci- 
dental stimuli and his recall of their location . 

Following their review of the latent-learning literature, MacCor- 
quodale and Meehl conclude: 

In spite of the preceding difficulties of interpretation, it seems safe to say 
that the current state of the evidence is at least onccnrair'ng to the theorist 
oriented to some form of expectancy theory. We were, frankly, somewhat more 
impressed by the overall trend of the evidence than we had expected to be. 5t? 

They have suggestions for both the major competing theories. First, 
they say, cognitive theorists have rested too often with attempts to con- 
found the opposition by posing the question “How do you explain that 
one?" The cognition theorist must take seriously the accounting for nega- 
tive results as well as taking satisfaction in the positive results. Second, 
reinforcement theorists have tended to use an “elastic" explanation based 
on fc* They must use the r G concept rigorously if they wish to assert the 
superiority of their type of explanation. 

ESTIMATE OF TOLMAN’S SIGN LEARNING 

Tolmans Position on the Typical Problems of Learning 

Because it is a system with some aspirations toward completeness, 
there are statements within Tolmans writings relevant to most of the 
problems raised by other writers. 

1. Capacity . Tolman has recognized the need for capacity laws. The 
matter interests him chiefly because of the possible gradation of learn- 
ing tasks from those requiring least to those requiring most intelligence. 
It is natural that one who makes predictions about what animals will do 
in problem-solving situations is confronted with the limitations of one 
organism as compared with another. Tolman believes that the high de- 
gree of specificity of capacities in the rat is due to the lack of influence of 
a culture which prizes certain behaviors over others. Hence one of the 
contributions of animal studies may be to show processes at a subcultural 
level. 57 

2. Practice . The law of exercise is accepted in the sense of the 
frequency with which the sign, the sxgnificate, and the behavioral rela- 
tion between the two, have been presented. Exercise is not the cause of 
the initial selection of the right response. Mere frequency without *1>e- 

68 MaeCorquodaie and Meehl (1954), page 213. 

67 Tolman (1945). 
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longing” does not establish a connection. After a response has been 
learned, overexercise tends to fix it, making it unduly resistant to 
change. 58 

3. Motivation. Rewards and punishment tend to regulate perform- 
ance, rather than acquisition, although they are related to acquisition 
also because they serve as “emphasizers” and because goal-objects con- 
firm or refute hypotheses. Because of the demonstration of latent learn- 
ing, the law of effect in its usual sense (reward as a strengthener of 
response tendencies) is not accepted. 

4. Understanding. Cognitive processes are of the very essence of 
molar behavior and learning. Hence Tolman is friendly to learning by 
creative inference, inventive ideation, and so on. He repeatedly states, 
however, that he does not wish to imply 4 mtrospectively get-at-able 
conscious contents.” The prototype of learning is sensible, reasonable ad- 
justment according to the requirements of the situation; stupid learning 
occurs as a limiting case when the problem is unsuited to the learner’s 
capacities or is set up in inaccessible form. Insightful learning is not 
limited to the primates; it is characteristic of rat behavior as well. 

5. Transfer . The problem of transfer of training as such has been 
of relatively slight interest to those experimenting with animals. To 
some extent all the experiments on change of reward, change of drive, 
place learning, and latent learning are experiments on problems related 
to transfer, that is, the ability to use something learned in one situation 
in relation to another. All cognitive theories expect a large measure of 
transfer, provided the essential relationships of the situation are open 
to the observation of the learner. 

6. Forgetting. Having earlier experimented in the field of retroac- 
tive inhibition, 59 it is probable that Tolman is friendly toward some theory 
of retroactive inhibition, and he has indicated that he accepts the 
Freudian mechanism of repression. 60 

He has also asserted the resistance of cathexes and equivalence be- 
liefs to forgetting, and the susceptibility of field expectations to the 
kind of forgetting emphasized by gestalt psychologists. 

Molar Behavior as a Field Concept 

The student of molar behavior contends that there can be a psy- 
chological science or behavioral science in its own right, not waiting for 

68 Krechevsky and Honzik ( 1932). 

88 Tolman (1917). 

60 Telman ( 1942), pages 63-64. 
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its progress upon advances in other sciences. While there is some con- 
troversy about this, it is a position which can be defended by both 
stimulus-response and cognition psychologists. Hull too, was able to 
adopt a concept of molar behavior by this definition. 

Beyond the matter of level of discourse there lies the question as 
to whether molar behavior is essentially a field concept. The question 
might be put another way. To what extent is Telman’s sign-gestalt theory 
ajriio sc-trT py\ choloeyP^Koftka, in his review of Tolman s book, wel- 
comed the friendliness to gestalt, but deplored some limitations which 
he detected in Tolman’s variety of gestalt. 61 He believed that the distinc- 
tion between molar and molecular did not go far enough, because the 
reality of the molecular was acknowledged. Koffka would wish field 
principles applied to physiology as well as to psychology, so that all 
explanatory concepts should be molar.^ Furthermore, Tolma n’s pre occupa- 
tion with beh avior sequen ces and historical interpretations makes sign- 
gestalts only a limited illustration of the variety of possible gestalts. 

This disadvantage, from Koffka’s point of view, makes it easier to 
compare Tolman’s experiments with those done under the influence of 
other theories. Since the experiments are done with rats in mazes and 
discrimination situations, they are directly comparable to the experi- 
ments in the typical American animal laboratory. Therefore Tolman chal- 
lenges prevailing conceptions such as those of Thorndike and Guthrie 
and Hull more vividly than the more orthodox gestalt writers, whose 
situations are often so different as to be incommensurate. 

Tolman makes so many generous acknowledgments to Lewin that 
there is no doubt of the affiliations between their systems, as third parties 
have made clear. 6 - Their differences became adjusted in a brief and 
friendly controversy. 63 

The conception of molar behavior is consonant with prevailing con- 
ceptions of science, in which some degree of arbitrariness is recognized 
in the abstractions to be made from the totality of natural events for 
the purposes of any given science. Natural laws can be formulated in 
many different ways to cover aspects of occurrences. Tolman has un- 
doubtedly done a service to psychology in joining with those who see 
the importance of a “psychological” psychology, that is, one whose con- 
cepts are appropriate to the level of its descriptions and predictions. 

Koffka (1933). -1? 

^E.g., White (1943). U ' 

33 Tolman (1932a), Lewin (1933b). For his appreciation of Lewin, see Tolman 
(1948a). 
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The Status of Intervening Variables 

The distinction between performance and learning, which must be 
accepted in one form or another, requires that learning be inferred from 
performance. These inferences are always being made, even in the most 
“objective” sorts of observations, because only “relevant” physical or 
physiological occurrences are recorded. There is jiothing especially new 
about Tolman’s intervening variable, except that he points out clearly and 
insistently that the intervening variable is there, and is not out of place 
in a behavioral science. The logical mistake is easily made of supposing 
that the physical and physiological terms to which the intervening varia- 
bles are anchored are themselves independent of theories, which they 
are not. The kinds of experiments which are performed, the kinds of 
measurements which are taken, always involve selection by the experi- 
menter. The data, even though reproducible, arenat“pure” facts of 
nature. The question may therefore be raised whether the intervening 
variables are going to be anything not found in the experimental rela- 
tionships directly. It is doubtful whether a satisfactory answer can be 
given until the intervening variables are identified with quantities. If 
there are derived constants, interchangeable from one situation to an- 
other, then the intervening variable becomes a scientific construct ofsome 
importance. 64 

As previously pointed out, Hull has adopted the intervening vari- 
able also, so that the logic is common to the differing points of view. 
The remaining difference lies in the sorts of intervening variables chosen. 

A distinction has been made by MacCorquodale and Meehl (1948) 
between two kinds of symbolic intermediaries: “intervening variables” 
and “hypothetical constructs.” A pure “intervening variable” is a mere 
convenience in system-making, and has no properties or dimensions other 
than those specified in the equations that define it. A “hypothetical con- 
struct” is an entity with properties attributed to it beyond those of the 
functional equations in which.it. appears. Hulls r G is an example of a 
hypothetical construct because it has all the properties of a muscular 
response and is said to give rise to proprioceptive stimuli (s G ) as any 
muscular response does. While it is hypothesized, rather than observed, 
it is potentially observable and need not always have the status of a 
metaphor. Alth^^gh Tolman s system appears to be built around “inter- 
vening variables in the MacCorquodale and Meehl sense (demands, 
cognitions, etc.), Tolman denies this and asserts not only. that all theories 

Seward (1955 ). 
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use both types of intermediary, but that hypothetical constructs are 
desirable. 05 

The distinction made by MacCorquodale and Meehl is a useful one, 
but it is a mistake to jump to the conclusion (which they did not sug- 
gest) that one of the types of intermediary is invariably preferable to the 
other. Possibly what we need is a clearer specification of the rules under 
which each should be used. Consider the gene theory in biology. If the 
gene is believed to be “a something” that has its place on the chromo- 
some, it is a hypothetical construct, and its added properties are such as 
may be revealed eventually under the microscope or by chemical anal- 
ysis. If it were merely a pawn to be used in the study of the statistics 
of unit characters and their genetic transmission, the gene would be a 
pure intervening variable. In this case, the status of hypothetical con- 
struct has seemed to be a useful one, as one considers chromosome maps, 
crossovers, and so on. The gravitational constant (g) used in the law 
of falling bodies is another kind of intervening variable, for it is merely 
a constant defining the acceleration of a free-falling body under speci- 
fied conditions, with no other status beyond that. It is a convenience in 
harmonizing free-falling bodies, balls rolling down inclined planes, and 
the period of a pendulum, and as such justifies itself even though it has 
no independent existence as an entity. Precisely what kinds of intervening 
variables prove useful in psychology will depend upon what we can do 
with them. The test comes in their clarity, specificity, and fertility in 
system-making. If a hypothetical construct is used, its surplus meanings 
should be reasonable, and ideally a search should go on for its inde- 
pendent verification. 

What Kind of Behaviorism? 

When Tolman announced his purposive behaviorism 60 American 
psychology was still excited over the new behaviorism of Watson. It 
was Tolman’s^contribution then to show that a sophisticated behaviorism 
can be cogmgjant of all the richness and variety of psychological events, 
and need jfct be constrained by an effort to build an engineer’s model of 
the learning machine. 

V^ith the diversification of behaviorism under the influence of Tol- 
man jtnd others, the old brittleness of Watsonian behaviorism has largely 
disappeared, and what virtues there are in the behavioristic position 

Tolman (1951b), pages 282-283. For a further discussion of the issues, see Marx 
(1951b), Ginsberg (1954), Maze (1954). 

Tolman (1922). His system was already ten years old when his major book ap- 
peared. 
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have now become part of thejunderlyingjassumptions of most American 
psychologists — without most of them thinking of themselves as behavior- 
ists at all. 67 

Tolman s catholicity and his friendliness to new ideas have prevented 
his developing a “tight” or “elegant” system, even though he has all along 
made suggestions as to what such a system would be like. He has kept 
his prominent place in the forefront of learning theorists by his sensi- 
tivity to important problems, by his inventiveness in experimentation, and 
by keeping others on their mettle. 
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final note 

At the time of writing, the most ambitious attempt of Tolman to provide a 
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constructs, is the following: 
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The reader interested in the further outreaches of his thinking will find much 
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Tolman s self-appraisal can be found in his autobiography: 
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For a thorough review of Tolmans position, with some suggestions as to 
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Chapter 7 

CLASSICAL GESTALT THEORY 


D URING the first quarter of the century in America the quarrels 
within academic psychology lay chiefly inside ’the framework of 
association psychology. Structuralism, functionalism, and behaviorism 
were all members of the association family. A few dissident voices, such 
as Freud and McDougall, got little hearing. This complacency was dis- 
turbed by the new gestalt doctrine which influenced American learning 
theories chiefly through the appearance in English of Wolfgang Kohler s 
Mentality of apes (1925) and Kurt Koffka’s Growth of the mind (1924). 
The theory had been developing in Germany since it was first announced 
by Max Wertheimer in 1912, but these books, and the visits of Kohler 
and Koffka to America about the time of their publication, brought the 
new theory vividly to the attention of American psychologists. 

Koffka’s book had an important effect upon American learning theory 
J because of its detailed criticism of trial-and-error learning as conceived 
by Thorndike — a thrust at the very heart of the currently popular theory. 
The vigorous attack upon Thorndike (and upon behaviorism, although 
Thorndike was not, strictly speaking, a member of the school) was sup- 
ported by Kohler's well-known experiments on apes, described in detail 
in his book which appeared close in*time to Koffkas. Kohler s book brought 
the notion of insightful learning into the foreground, as an alternative 
to trial and error. He showed how apes could obtain rewards without 
going through the laborious processes of stamping out incorrect responses 
and stamping in correct ones, as implied in Thorndike's theories and as 
displayed in the learning curves of Thorndikes cats. Apes could use 
sticks and boxes as tools; they could turn away from the end of the 
activity toward a means to the end. 

Kohler's experiments with apes were done in the years 19X3-1917, 
on the island' of Tenerife off the coast of Africa, His book about these 
experiments 1 appeared in English a few years later and immediately was 
i Kohler (1917). 
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widely read and quoted. Two main series of experiments interested the 
American psychological public in the problems of insight. These were 
the box problems and the stick problems. 

In the single-box situation, a lure, such as a banana, is attached to 
the top of the chimpanzee’s cage. The lure is out of reach but can be 
obtained by climbing upon and jumping from a box which is available in 
the cage. The problem is a difficult one for the chimpanzee. Only Sultan 
(Kohler’s most intelligent ape) solved it without assistance, though six 
others mastered the problem after first being helped either by having 
the box placed beneath the food or by watching others using the box. 
The problem was not solved by direct imitation of others. What watch- 
ing others use the box did was to lead the observer to attempt to use the 
box as a leaping platform, but sometimes without making any effort 
whatsoever to bring it near the lure. When the problem was mastered, a 
chimpanzee alone in a cage with box and banana would turn away from 
the goal in order to seek the box and to move it into position. This “de- 
tour” character of insightful behavior is, according to Kohler, one of its 
important features. 

The box-stacking problem, requiring that a second box be placed 
upon the first before the banana can be reached, is much more difficult. 
It requires both the incorporation of the second box into the pattern of 
solution, and a mastery of the gravitational problem of building a 
stable two-box structure. While the emphasis in secondary accounts of 
Kohlers work is usually upon the intelligence which his apes displayed, 
he himself is at pains to account for the amount of apparent stupidity. 
In the box-stacking experiment, for example, he believes that the apes 
have shown insight into the relationship of “one-box-upon-another,” but 
not into the nature of a stable two-box structure. Such physical stability 
as was achieved in later structures was essentially a matter of trial and 
error. 

The stick problems required the use of one or more sticks as tools 
with which to rake in food out of reach beyond the bars of the cage. 
The beginning of insight occurs as the stick is brought into play; al- 
though often unsuccessfully, as when it is thrown at the banana and lost 
Once it has been used successfully, it is sought after by the chimpanzee 
and used promptly. The most dramatic of the stick-using experiments 
was in a problem mastered by Sultan, in which eventually two sticks 
were joined together after the manner of a jointed fishing pole in order 
to obtain a banana which could not be reached with either stick alone. 
The process was a slow one, and the first placing of the sticks together 
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appeared to be more or less accidental. Once having seen the sticks in 
this relationship, however, Sultan was able to “get the idea” and to 
repeat the insertion of one stick into the end of the other over and over 
again. 

Although the attack by Kohler and Koffka was chiefly upon Thorndike, 
it came at a time when American psychology was in the grips of a confident 
but somewhat sterile behaviorism. It is hard to see at this distance why such 
a common-sense and familiar notion as insight in learning should have 
created such a stir. But at the time Watsonian behaviorism had, in fact, 
won support for a fairly “hard-boiled” view of learning, according to 
which the organism was played upon by the pushes and pulls of the en- 
vironment and reacted in ways essentially stupid. Lloyd Morgan s canon 
which had seriously undercut the attributing of higher mental processes 
to animals had fairly well succeeded via behaviorism in excising them 
from man also. Therefore the return to a more balanced view, represented 
by the insight experiments, gave new hope to teachers and others who 
saw thinking and understanding returned to respectability. Insight was 
not a new discovery — it was a return to a conception laymen had never 
abandoned. Nobody uninfluenced by peculiar doctrines would ever have 
denied insight as a fact — yet it took Kohler to restore it as a fact in 
American psychology. It was, in some respects, time for a change, and 
Kohlers experiments dramatized release from the negatives of Thorn- 
,,dikian and Watsonian thinking. 

That the more enthusiastic reception for the new learning theories 
should have come first from the educators is not surprising. 2 There had 
already been a rift growing between Thorndike and the more progressive 
group within education, who, under Dewey’s leadership, had made much 
more than he of the capacity of the individual for setting and solving his 
own problems. The new insight doctrine fitted nicely their slogan of 
freeing intelligence for creative activity. 

Animal psychologists like Yerkes, who had never espoused behavior- 
ism, welapmed the new movement as a natural development. Yerkes him- 
self had done experiments on insightful learning independent of gestalt 
influences, 3 and the intelligence demonstrated by Kohler s apes did not 
surprise him. Curiously enough, insightful learning in subhuman ani- 
mals was less threatening to theorists than learning by understanding in 
man, chiefly because it was still the rather rare and unusual behavior 
among animals. Rats still learned mazes, it was thought, without in- 

2 It was an educator-psychologist, R. M. Ogden, who translated Koffka ( 1924 ). 

* E.g., Yerkes ( 1910? 
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sight. So the animal experimenters added insight experiments to their 
list and continued both old and new experiments. But if the insight 
doctrine were to be accepted in human learning, the field would be wide 
open for destroying all the familiar laws of learning as they applied to 
man. It is not surprising that those who were at the time concerned more 
largely with human learning, such as Thorndike, Robinson, and Guthrie, 
should all have been cool to the insight concept. 

The visible opposition between Kohler and Thorndike was over 
insight and trial and error, that is, over intelligent learning as con- 
trasted with blind fumbling. But the opposition between gestalt psy- 
chology and association psychology goes much deeper. In order to under- 
stand this opposition, it will be necessary to examine the gestalt views 
in greater detail. 

There are a number of variants within the gestalt movement and 
among those strongly influenced by gestalt conceptions. Kohler and 
Koffka were closest to Wertheimer, the official founder of the school. 
This chapter is devoted to their treatment of learning. Lewin, while 
originally from Berlin and definitely within the ranks, broke enough new 
ground so that a separate chapter is devoted to his position. All four 
of these men, originally German, eventually settled in America, where 
three of them (Koffka, Lewin, and Wertheimer) have since died. They 
are the leaders of what is historically gestalt psychology. 

Their views were picked up and considerably modified by Wheeler 
(1929) (1982) (1940) under the name of organismic psychology , but 
his position deviated too much from classical gestalt psychology for 
him to be counted within the core group. 4 

The fullest and most systematic treatment of the problems of learn- 
ing from the gestalt viewpoint is found in Koffka s Principles of gestalt 
psychology (1935). It was written after a period of acclimatization to 
America, and so meshes somewhat better than earlier writings with the 
concerns of American psychologists. Most of the direct references will be 
made to this source. 

4 Wheeler’s views were elaborated, with Perkins, in a widely used textbook on edu- 
cational psychology, Wheeler and Perkins {1932). For an account of the organismic 
position, see the first edition of this book, Hilgard (1948), pages 234-260. Because 
Wheeler’s theory is no longer influential, the chapter has been dropped from this 
edition, despite some provocative ideas contained in the theory. 
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THE LAWS OF ORGANIZATION 

• Gestalt psychology had its start and has achieved its greatest success 
in the field of perception. Its demonstrations of the role of background 
and organization upon phenomenally perceived processes are so convinc- 
ing that only an unusually stubborn opponent will discredit the achieve- 
ment The primary attack upon association theory was an attack on the 
"bundle hypothesis” sensation theory — the theory that a percept is made 
up of sensation-like elements, bound together by association. 

When the gestalt psychologists turned later to the problems of 
learning, the equipment brought to the study of learning was that which 
had succeeded in the field of perception, and the arguments previously 
used against the sensation were turned against the reflex. In spite of the 
attention which Kohler s ape experiments received, gestalt psychologists 
can be fairly said to have been only moderately interested in learning. 
This does not mean that their few experiments are without significance; it 
means only that they have considered the problems of learning secondary 
to the problems of perception. Perhaps in America the shoe is on the 
other foot, and in preoccupation with learning we have too long neglected 
the relationship between the two fields . 6 

The starting point for Koffka’s treatment of learning is the assump- 
tion that the laws of organization in perception are applicable to learn- 
ing. This applicability is enhanced because of the prominence in learn- 
ing given in his theory to the initial adjustment, to the discovery of the 
correct response in the first place. Since this discovery depends upon 
the structuring of the field as it is open to the observation of the learner, 
the ease or difficulty of the problem is largely a matter of perception. In 
some sense, Kohler’s apes were presented with perceptual problems; if 
they literally "saw” the situation correctly, they had "insight.” 

The application of the laws of organization to learning problems is 
done too casually by Koffka to be very convincing, but if he had been 
more systematic about it, the development of the argument might have 
gone along the following lines. There would be a guiding principle ( the ’ 
M w °f Fragnanz) and four laws of organization subordinate to it: the 
laws of similarity, proximity, closure, and good continuation. 

5 The point was made some years ago by Leeper ( 1935b). 
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The Law of Pragnanz 

The law of Pragnanz 6 suggests the direction of events. Psychological 
organization tends to move in one general direction rather than in other 
directions, always toward the state of Pragnanz , toward the “good” 
gestalt. The organization will be as “good” as prevailing conditions 
allow. A “good” gestalt has such properties as regularity, simplicity, 
stability, and so on. 

Because of the dynamic properties of “fields,” the conditions of 
equilibrium are necessarily important. In physics,* processes which 
terminate in stationary distributions are characterized by certain maxima 
and minima, as by a minimum of energy capable of doing work. This 
minimum for the whole sometimes requires a part to absorb a maximum 
of energy. The law of Pragnanz is a law of equilibrium like the princi- 
ples of the maximum or minimum in phvsics.'lt ought to correspond to 
them in reality, since it is the phenomenal representation of physiological 
processes which obey physical laws. In effect, however, it is used as an 
analogy /When organization moves toward a minimum, it is character- 
ized phenomenally by the simplicity of uniformity; when it moves toward 
a maximum, it is characterized by the simplicity of perfect articulation. 7 
“We might say, sacrificing a great deal of the precision of the physical 
proposition, that in psychological organization either as much or as 
little will happen as the prevailing conditions permit.” 8 

Learning situations are problematical. They therefore give rise to 
tensions and to disequilibria. Some such principle as the law of Prag- 
nanz becomes appropriate to them, although Koffka does not develop the 
point, except by way of the other laws, each of which, in its own way, 
is an illustration of the more general principle. 

1. The law of similarity . The law of similarity or of equality is the 
counterpart of association’s law of similarity. This and the other laws 
all derive from Wertheimer (1923). He used it as a principle determin- 
ing the formation of groups in perception, such as groups of lines or 
dots. Similar items (e.g., alike in form or color) or similar transitions 
(e.g., alike in the steps separating them) tend to form groups in per- 
ception. In a series of experiments with nonsense syllables, two-place 
numbers, and nonsense two-dimensional figures, Kohler (1941) showed 
quite conclusively that similar (homogeneous) pairs were much more 

6 The German word is inadequately translated as "pregnancy.” It has the meaning 
of “knapp, und dock vielsagend” ( compact but significant ) . 

* Koffka (1935), pages 171-174, 

8 Koffka (1935), page 108. 



228 THEORIES OF LEARNING 

readily learned than dissimilar (heterogeneous) ones. Homogeneous 
pairs illustrate the law of similarity. He denied that his results supported 
simple association theory, preferring instead to attribute the results to 
“interaction” producing a unitary trace rather than a “connection” be- 
tween similar items. 

The law of similarity was applied by Koffka to the selection of a 
memory trace by a process active at the time of recall. That trace will be 
selected by an excitatory process which possesses the same wholeness 
character. The meaning of trace selection will be considered later. The 
meaning is conveyed sufficiently by the process of recognition, where a 
face present now recalls the same one seen earlier and results in the 
feeling of familiarity. 

2. The law of proximity. Perceptual groups are favored according to 
the nearness of the parts. Thus if several parallel lines are spaced un- 
evenly on a page, those nearer together will tend to form groups against 
a background of empty space. Because whatever favors organization 
will also favor learning, retention, and recall, the law of proximity be- 
comes the gestalt equivalent of association by contiguity. Patterning 
through proximity holds also within audition, as in the grouping of 
successive clicks. Then the proximity is a temporal one. As it applies to 
memory, the law of proximity becomes also a law of recency. Old xvtf 
pressions are less well recognized and recalled than new ones becar^ 
the recent trace is nearer in time to the present active process. 9 

3. The law of closure. Closed areas are more stable than unclosed 
ones and therefore more readily form figures in perception. As applied 
to learning, closure is an alternative to the law of effect. The direction of 
behavior is toward an end-situation which brings closure with it. It is 
in this manner that rewards influence learning. 

So long as activity is incomplete, every new situation created by it is still 
to the animal a transitional situation; whereas when the animal has attained his 
goal, he has arrived at a situation which is to him an end-situation. 10 

In a problematic situation the whole is seen as incomplete and a 
tension is set up toward completion. This strain to complete is an aid to 
learning, and to achieve closure is satisfying. This is the meaning of 
the above quotation, and shows how closure is an alternative to effect. 

4. The law of good continuation. This is the last of Wertheimer s 
principles taken over by Koffka, although Wertheimer had several more. 

» Koffka (1935), page 464. 

10 Koffka (1924), page 102. From The growth of the mind by K. Koffka, Kegan 
Paul, Trench, Trubner & Co., Ltd., London, 1924. 
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Organization in perception tends to occur in such a manner that a 
straight line appears to continue as a straight line, a part circle as a circle, 
and so on, even though many other kinds of perceptual structuring would 
be possible. Closure and continuation are aspects of articulate organiza- 
tion. Oragnization applies to learning as well as to perception. 

Kohler (1941) demonstrated, for example, that the learning of 
paired figures was facilitated when the figures ‘‘fitted,” that is, when the 
cue and response items formed a regular pair. Pairs fitting less well were 
harder to learn. 

In each of the foregoing laws we have a principle from perception 
applied to learning. 

THE SPECIAL PROBLEMS OF LEARNING 

The general point of view of gestalt psychology is expressed in the 
statement that the laws of organization apply equally to perception and 
to learning. There are, however, special problems within learning to 
which Koffka devoted considerable discussion. Because of his anti- 
empiricist 11 position, he had to find some way of dealing with the evi- 
dent influence of earlier experiences on present performance. The prob- 
|pn is best approached via memory, in which the past is represented 
pmehow in the present. A second problem concerns the gradual trans- 
formation which takes place as skills of the trial-and-error sort are mas- 
tered. Finally, of course, there is the problem of restructuring the present 
field, as implied in insightful learning and in productive thinking. 

The Role of Past Experience: the Trace 

Because modification by and through experience is part of the very 
definition of learning, the gestalt attitude toward experience is important. 

The gestalt preference is distinctly for conceiving psychological proc- 
esses as the function of the present field, and the explanatory role of past ex- 
perience is denied in situation after situation in which to others it seems to 
be important. Examples include the perceptual constancies — whereby a 
man looks man-size at a distance, a red coat looks equally red in sunshine 
and shadow. The illusions of movement and perception of third dimension 
are also included. Koffka, in spite of a vigorous objection to empiricism, 
takes a moderate view toward the role of past experience in learning, 

11 Koffka was anti-empiricist, objecting to an explanation of present perception solely 
on the basis of past experience. This did not make him a nativist, however, for he 
believed that gestalt laws of organization provided a third solution to the problem of 
space perception. See Koffka ( 1935), pages 160-161; Luchins ( 1951), pages 83-86, 
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It will not be necessary to point out that an anti-empiristic attitude does not 
mean the denial of the enormous value of experience. Not that it makes use of ex- 
perience causes our objection to empiricism, but how it makes use of it. 12 

A favorite experiment repeatedly cited by gestalt psychologists in 
order to disprove the role of experience is that of Gottschaldt (1926). 
For example, if a picture of a letter E is presented 1000 times, and then 
a church window is exposed, are you any more likely to notice that 
some of the leaded lines in the windows could form a letter E than if 
E had been presented only once? We may doubt if a jury of association 
psychologists or anyone else would expect experience to tear down a 
percept into the thousand and one possible parts unless there were some 
kind of search involved. If you looked for the hidden part, and found 
it, the finding would be easier the next time. Gottschaldt accepts this 
conclusion, and had evidence that the results of previous discovery were 
evident in later tests. In the newspaper puzzles with faces hidden in the 
trees, once the face has been found it is more easily found again. Gott- 
schaldt's experiment reduces to the demonstration that camouflage hides 
familiar objects as effectively as it hides unfamiliar ones — provided we 
have no reason to be looking for the familiar objects. Gottschaldt's experi- 
ments are cited by gestalt psychologists as very damaging to association- 
ist or empiricist positions. Because of the difference in conception as to 
what is important, the experiment has not impressed association psycholo- 
gists as being a crucial refutation of their position. 13 

; By the very nature of the case, it is not as easy for the gestalt psy- 
chologist to dismiss the role of experience in memory as it is to dismiss 
its role in perception. Memory so obviously depends upon prior experi- 
ence that it would be foolhardy to deny it. Koffka is as puzzled as a non- 
gestalt psychologist over Wheeler's attempt to get rid of memory traces. 14 
Koffka believes some trace theory essential and proceeds to consider how 
the traces of past experiences can be reactivated by present processes. r 
The trace hypothesis is an involved one, and its full exposition re- 
quires over 100 pages of text. 15 The essential features of the theory are 
(!) a trace is assumed which persists from a prior experience, so that it 
represents the past in the present; (2) a present process is also posited, 
one which can select, reactivate, or in some manner communicate with 
the trace; and (3) there is a resulting new process of recall or recogni- 

12 Koffka (1935), page 639n. 

13 Moore (1930) BraJy (1933) and Henle (1942a) dispute Gottschald^s interpreta- 
tions. 

Wheeler ( 1932), pages 167-169. 

15 Koffka (1935), pages 423-528. 
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tion. The process and the trace are to be distinguished; they are localized 
in different parts of the brain. The trace system is organized according to 
the same laws applying in other fields, and the communication between 
process and trace follows these laws. 

The trace concept was further elaborated by Kohler ( 1938 ) and by 
Katona (1940)/'Katona made a distinction between individual traces, 
referring to specific items, and structural traces, derived from the whole- 
ness character of a process. The structural traces are said to be more 
adaptable and flexible, to be formed more quickly, and to persist longer 
than individual traces. 16 ^ 


By way of the doctrine of traces, the gestalt psychologist is able to 
represent a past event in the present. That is, of course, all that the as- 
sociation psychologist proposes to do/But the trace system, if it is a 
system organized under gestalt laws, must undergo changes according to 
the law of Pragnanz.JLi these changes are of a systematic sort it will be 
evidence against a theory of mere connections weakening in time or in- 
hibited by new learning. 

The experiments of Wulf (1922) and later experiments following up 
his suggestions will be reviewed among the illustrative experiments. The 
main point is that perceived figures are reproduced differently from the 
original model, and that the differences are systematic and progressive 
rather than random. The changes with successive reproduction cor- 
respond to the laws of organization, and move toward the "good” gestalt. 
A circle with a small break in it tends to be reproduced as a whole circle, 
and an asymmetrical figure tends to become more symmetrical in a draw- 
ing of it from memory. 'There are two chief tendencies noted by Wulf: 
leveling and sharpening . The leveling tendency is that already described, 
a tendency to move according to the intrinsic character of the figure into 
symmetry and uniform relations of parts. Sharpening consists in the ac- 
centuating of details which serve as the discriminatory features of the 
pattern. For example, a saw-toothed figure may be reproduced with 
deeper and more striking teeth./ Against the theory that memory leads 
to decay and fuzziness, the gestalt theory is that it leads to change but 
in the direction of greater clarity. A third tendency pointed out by Wulf 
is called normalizing . A figure which looks something like a familiar ob- 
ject tends on reproduction to be drawn more like such an object. All 
these changes (leveling, sharpening, and normalizing) are in the direction 
of a “good” gestalt. 

To the extent that these systematic changes occur in the trace, there 


16 Katona (1940), pages 194-195. 
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is a real addition which gestalt theory makes to other theories of memorial 
change. 

New Learning: the Formation of Traces 

What happens as new traces get formed? It is to be recalled that 
Koffka distinguishes between the process and the trace. The process is 
that which goes on because of the present stimulating situation; the trace 
is the result of earlier processes. 

1. Some processes are directly dependent upon stimuli. When such 
stimuli are presented a second time, the processes differ from those pres- 
ent the first time because the stimuli have been reacted to before. For 
example, the second exposure may be recognized as “familiar.” This dif- 
ference suffices to show that learning took place with the first exposure. 
His illustrations are limited to perceptual ones, and it is apparently 
perceptual responses to which Koffka refers when he speaks of processes 
directly dependent upon stimuli. 17 

2. Processes may undergo transformation within a single sustained 
presentation. For example, when a series of sentences about mathematics 
is finally “understood” as a demonstration or proof, such a transforma- 
tion has occurred. The insight experiments also illustrate such transforma- 
tions. 18 

3. Some processes are transformed by their consequences. This 
amounts to an acceptance by Koffka of the empirical “law of effect,” but 
the explanation differs from Thorndike’s. The transformation of process 
is at base the same as in the insight experiments, but it often occurs piece- 
meal as a consequence of the experimental arrangements. In the insight 
experiments all the data necessary for the transformation of process are 
present simultaneously, so that restructuring can take place at once. In 
the typical trial-and-error experiment, by contrast, the situation cannot 
be understood until the animal's activity has itself led to consequences — 
to food, to freedom, and so on. Once success is achieved, the process lead- 
ing to success is transformed. It has a new meaning, a new role in the 
goal-directed activity. 19 

The Effects of Repetition: the Consolidation of Trace 

Systems 

The aggregate trace system resulting from repetition is always being 
transformed. With each repetition the trace organization left from pre- 

17 Koffka (1935), pages 549-550. 

18 Koffka ( 1935), pages 55.5-556. 

18 Koffka (1935), page 552. 



233 


CLASSICAL GESTALT THEORY 

ceding processes interacts with the present process to create something 
new * According to the pr inciple of retro active inhibition (which Koffka 
accepts), preced ing individual traces are disrupted by the new learning. 
Repetition can still be beneficial, however, because the trace syst e m be- 
comes consohdatecTeven while individual traces are d estroyed. As the 
trace system becomes more tixed, it becomes ju-e- eminent over process 
and exerts more influence on future processes than such processes affect 
it. Such a trace system is said to become increasingly available; that is, it 
corresponds to what associationists think of as a habit system ready to 
function. A precaution is needed in the interpretation of availability, 
for conditions which make a trace more and more available for mere 
repetitions of one process may make it less available for other processes . 20 
This is one of the dangers of too much drill in the school subjects, because 
drill may have a narrowing or “blinding” influence . 21 

The treatment of the acquisition of skill by Koffka is sketchy and 
conjectural, for the problem has not been experimentally attacked by 
members of the gestalt group, But skill is made coherent with the process- 
trace theory through a line of thought somewhat as follows. The trace, as 
part of the field of a process, exerts an influence on the process in the 
direction of making it similar to the process which originally produced 
the trace . 22 This is close to Guthrie’s statement that we tend to do what 
we last did in the same situation. Highly perfected skills can be repeated 
after periods of disuse because the process communicates with a stable 
trace system to which it then conforms/ While the skill is being learned 
the trace is less stable. Through the interaction of trace and process, 
greater stability is achieved. This achievement of greater stability is 
what is meant by improvement in the skill/ Because the trace system, 
obeying dynamic laws, also undergoes stabilizing changes over a period 
of no practice, the greater improvement with distributed practice than 
with massed practice is explained. 

Restructuring the Present Field: Insight 

The contrast between trial and error and insight is a subject of some 
misunderstanding because there are empirical facts on the one hand and 
theories about these facts on the other. So far as empirical situations 
are concerned, there are experiments which demonstrate a maximum of 
fumbling, with gradual improvements and little understanding of how 
improvement takes place. These may be classified as experiments in 

Koffka (1935), page 547. 

2X Luchins (1942). 

22 Koffka (1935), page 553. 
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which learning is by trial and error, without prejudging the processes 
to be invoked in explaining the learning. There are also experiments 
in which the learner obviously perceives a relationship which leads to 
a problem solution, and the experiment may be classified as an insight 
experiment. And there are situations which fall between, where there is 
partial insight combined with rather blind trial and error. The empirical 
grading of situations does not mean that the interpretations have to be 
so graded, that is, that the trial-and-error behavior must be explained 
by a trial-and-error theory, insight by an insight theory, and mixed be- 
havior by appropriate mixtures of the theories. This impression is oc- 
casionally given by writers taking a middle-of-the-road position. The 
theorist, recognizing these empirical differences, tries to account for the 
differences on the basis of a unified set of principles. 


f 

Descriptive Characteristics of Insightful Learning 

We may distinguish between what goes on in an actual experiment 
investigating insightful learning and the distinctive criteria of insight, 
for what goes on is by no means limited to insight, and overlaps greatly 
with other forms of learning. 

1. A more intelligent organism is more likely to achieve insight, just 
as it is more likely to be successful at other forms of complex learning. 
Thus older children are more successful at insight problems than younger 
ones (e.g,, Richardson, 1932), and apes more successful than guinea 
pigs. 


2. An experienced organism is more likely to achieve insightful solu- 
tion than a less experienced one (e.g., Birch, 1945b). To some extent in- 
sight depends on past experience, as other forms of learning do. Thus a 
child cannot get insight into a mathematics problem stated symbolically 
unless he understands the conventional signs, even if the problem is other- 
wise suited to his capacity. 

The difference between association theories and gestalt theories lies 
in the implication of association theories that the possession of the 
necessary past experience somehow guarantees the solution. 23 While 
gestalt theorists would agree that past experience will facilitate solution, 
they object to explanations in terms of previous experience without 
taking organization into account. More is needed than the necessary 
amount of information. Just knowing enough words does not cause you 
to write a poem. The necessary experience alone does not solve the prob- 


23 The issue is discussed again (Chapter 12) in connection with the experiments and 
theories of Harlow ( 1949). 
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lem. In one of his early experiments, Maier ( 1930 ) provided his subjects 
with all the experience necessary for solving a problem, but only one of 
thirty-seven solved it. The past experience had to be used appropriately 
before solution would occur. 

3. Some experimental arrangements are more favorable than others 
for the elicitation of insightful solution. Organization is contributed both 
by processes inherent in the organism and by structural patterning in the 
environment, and, of course, basically through the interaction of organism 
and environment. Insight is possible only if the learning situation is so 
arranged that all necessary aspects are open to observation. If a needed 
tool is hidden, its use in solution is made unlikely, or at least more diffi- 
cult. In one form of the puzzle-box it is necessary for the rat to dig 
through a sawdust floor to discover a concealed tunnel winch permits 
exit. Because the entrance to the tunnel is concealed beneath a uniform 
bed of sawdust, insight is impossible, and the first solution necessarily 
occurs almost by chance — being aided only by the fact that sawdust- 
digging is within the rat’s habitual action pattern. The parts which need 
to be brought into relationship for solution are assembled more easily if 
they are simultaneously present in perception; for example, it is harder 
for an ape to learn to use a stick which lies on the side of the cage op- 
posite the food than one which lies on the same side as the food (Jack- 
son, 1942), 

Skilled teachers are well aware of differences between situations in 
which understanding is arrived at easily and those in which it is achieved 
with difficulty — even though the same ultimate steps are involved and 
the same end stage reached. In the favored arrangement the problem is 
so structured that significant features are perceived in proper relation- 
ship, and distracting or confusing features are subordinated. Some mathe- 
matics teachers make problem solution difficult to grasp because they go 
through derivations step by step without an overview of where the steps 
are leading or what the articulating principles are. They teach the neces- 
sary operations, but the final insight eludes the students because of the 
manner in which the proof is arranged, 

4. Trial-and-error behavior is present in the course of achieving 
insightful solution. In the presolution period the learner may make many 
false starts and be engaged in activity which can be characterized as 
trial and error. When insight will come (if it does come) is not pre- 
dictable. These two features (initial fumbling and lack of predictability) 
have been used by opponents of insight either to assimilate it to associa- 
tive learning because trial and error occurs, or to characterize it as 
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mystical, non-scientific, or accidental because the moment of solution 
cannot be predicted. 

The reply to those who find trial and error in insight experiments, 
and therefore wish to make insightful solution continuous with ordinary 
associative learning, is that fumbling in problem-solving is not mere trial 
and error. Even those who tend to favor trial-and-error interpretations 
have come to speak in terms of approximation and correction 34 or in 
other ways to indicate that the “try” is a real try and not just any old 
action in the behavior repertory. In the case of adult insight experiments, 
the "try” is often a plausible hypothesis which has to be rejected. A suc- 
cession of such hypotheses may be tried before the appropriate one is hit 
upon. The more intelligent reasoner may actually take longer to solve a 
given problem because he commands a greater variety of hypotheses to 
bring to its solution. There is a theoretical distinction which ought to be 
made between blind fumbling and intelligent searching. Merely varied 
behavior is one thing; behavior testing hypotheses is varied also, but ac- 
cording to a different type of organization. 

That random behavior and luck may further solution is illustrated by 
some behavior which I observed one summer while assisting Yerkes in 
an insight experiment with a young chimpanzee. The problem set the 
animal was to obtain a banana from a long hollow box, open at both 
ends. 25 The box, essentially a rectangular tube, was firmly fastened to 
the floor of a large cage. The banana was inserted through a trap door 
in the middle of the box under the watchful eye of the animal, then 
the trap door was securely padlocked. The chimpanzee, after a number 
of unsuccessful efforts to obtain the banana by direct attack — reaching 
in either end of the tube with hands and with feet, attempting to lift 
the tube from the floor — seemed to give up temporarily, or, as gestalt 
psychologists say, to “go out of the field.” This extraneous behavior took 
the form of playful cavorting. In this mood the animal incorporated into 
her play the hoe handle which was standing in the corner of the room, 
climbing it, and throwing it. Once the handle fell with its end near the 
open tunnel. The chimpanzee stopped her play, became calm, looked 
reflective, and, for the first time in her history used the pole as a tool to 
push the banana out of the far end of the tube. 

The lucky position of the hoe handle structured the situation per- 
ceptually to make solution easier. It brought the hoe handle in as a pos- 
sibility and gave direction to the problem-solving behavior. It did not 

2* Dodge (1931). 

25 The box is illustrated in Yerkes { 1948). 
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add to the chimpanzee s past experience, but it made it easier to as- 
semble the experiences appropriate to solution. Out of what was super- 
fluous activity there thus developed a “hint” as to the direction of solu- 
tion. An illustration of the way in which direct hints may aid solution 
is provided by Maiers (1930) experiment previously referred to. By 
giving a few “hints,” in addition to the necessary past experiences, solu- 
tions were obtained to the same problem by a much larger fraction of 
his subjects. 

The objection to insight that it is unpredictable and therefore out- 
side of science is lacking in force. The moment of insight is not the im- 
portant feature in any case. Other features, such as reproducibility of 
the behavior and applicability to new situations are more important. But 
even though the moment of insight for a given animal confronted with 
a given problem is not predictable, it is possible to arrange problems in 
an order of difficult}’ so that the degree of probabilitij that insight will 
occur is predictable. To assume that all predictions based on past occur- 
rences (empirical probabilities) imply associative learning 26 is to make 
of association a term so broad as to be meaningless. 

The four characteristics described (effects of capacity, prior experi- 
ence, experimental arrangements, and trial and error) are not distinctive 
for insight, although in the course of discussion certain suggestions were 
made of differences in interpretation when the solution is viewed in 
stimulus-response-association terms and when it is viewed as the achieve- 
ment of insight. 

J Distinctive Criteria of Insight 

Because of the overlap of what happens in trial and error and in in- 
sightful learning, can we specify a few defining criteria by which insight 
can be clearly differentiated from other kinds of problem solution? 

An early list of evidences of insight was provided by Yerkes (1927), 
based upon his analysis of photographic records of the problem-solving 
of a young gorilla, following upon earlier experiences with the orang- 
utan and chimpanzee. 

In acts which by us are performed with insight or understanding of rela- 
tions of means to ends, we are familiar with certain characteristics which are im- 
portant, if not differential. The following is a partial list of features of such 
behavior. It is presented here with the thought that the comparative study of 
behavior with insight, in different organisms, may reveal common characteristics. 

(1) Survey, inspection, or persistent examination of problematic situa- 

86 The assumption is made by Guthrie 11935), page 193, and by Guthrie and 
Horton (1946), page 42. 
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tion. (2) Hesitation, pause, attitude of concentrated attention. (3) Trial of more 
or less adequate mode of response. (4) In case initial mode of response proves 
inadequate, trial of some other mode of response, the transition from the one 
method to the other being sharp and often sudden. (5) Persistent or frequently 
recurrent attention to the objective or goal and motivation thereby. (6) Appear- 
ance of critical point at which the organism suddenly, directly, and definitely 
performs the required adaptive act. ( 7 ) Ready repetition of adaptive reasponse 
after once performed. (8) Notable ability to discover and attend to the essential 
aspect or relation in the probfematic situation and to neglect, relatively, varia- 
tions in non-essentials. 27 

On the basis of his observations of the young gorilla Congo, Yerkes 
went on to say that these observations confirmed his suspicion "that the 
conventional formula for habit-formation is incomplete, and the process 
of ‘tidal and error wholly inadequate as an account of anthropoid adap- 
tations.” 

The three types of evidence for insight most convincing to the ex- 
perimenter seem to me to be three: First , the interruption of movemen t 
f or a period , referred to by Yerkes as one of survey, inspection, attention, 
followed by the critical solution. This combines Yerkes* points (1), (2), 
(5) and {6). In the chimpanzee experiment described above, this stage 
was initiated following the "lucky” fall of the hoe handle near to the open- 
ing of the box. Second , the ready repetition of the solution after a single 
critical solution^This is Yerkes* point (7). The ape in the illustration 
given was returned to the experimental room on the following day. Every- 
thing was arranged as before. When the banana was locked into position 
and the chimpanzee released, there was a single flip of the lock (it might 
have been unfastened! ) and then the animal went directly for the hoe 
handle, carried it over a shoulder in a manner very different from the day 
before, and proceeded to use it appropriately as a tool. There was no 
byplay, no dropping it on the floor. This was convincing to the experi- 
menters as evidence that the previous day’s solution was accompanied 
by insight. (There might be some argument as to when the insight came, 
whether before or after the previous, day’s success, although the pause 
before solution of the first day strongly suggested that insight came be- 
fore the first solution . ) Third, solution with insight should be generalized 
to new situations that require mediation by common principles, or aware- 
ness of common relationships. While this is not specified in Yerkes’ list, 
it is implied in the multiple-choice experiment, a kind of insight experi- 
ment that Yerkes invented. Here insight is indicated by a perceived re- 
lationship such as “the middle one of the open doors” “the second from the 

27 Yerkes (1927), page 156. For other criteria of insight, see Pechstein and Brown 

/ * r\nr\ \ ° 
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right of the open doors.” In human subjects these generalizations are 
often put into words when the problem is solved. „ 

These three types of evidence are not entirely clear-cut and un- 
ambiguous, but they serve reasonably well when the whole context of 
the problem is considered. A rat pauses, too, before making a jump in 
the Lashley discrimination apparatus. How, one may ask, do you dis- 
tinguish this pause from the kind Yerkes speaks of? The answer lies 
partly in the greater freedom of Yerkes* animals to move about and to do 
other things. Interrupted behavior is more readily observed when the 
possibilities of varied behavior are rich. So too the repetition of a first 
solution can be made without insight, as shown by the Guthrie and Hor- 
ton cats. However, when the release pole was moved a few inches they 
were unable to use their prior pole-pushing habits, thus failing to live up 
to the third kind of evidence for insight. 28 But a chimpanzee or gorilla 
that has learned to obtain a banana with a stick will search for a stick 
when a banana is out of reach, or will improvise a stick substitute out of 
bundled straw. Having reacted to the more abstract relationship of 
stick-as-a-tool-to-obtain-banana, it is not disturbed by a slight change in 
the environment. 

Such applications of a perceived relationship to another situation in 
which it is applicable is the equivalent of transfer of training. The gestalt 
writers prefer to speak of it as transposition , on the pattern of a trans- 
posed melody. What is transferred is a relationship or a generalization, 
although the contents in the two situations may be entirely changed. 
v \ Stimulus-response psychologists recognize a corresponding application 
of old learning to new problems through stimulus generalization. While 
there is some overlap between the theories at all points (hesitation, 
repeated solution, transposed solution ) there is no denying that, descrip- 
tively, behavior characterized as insightful occurs. The residual differ- 
ences between the two theories lie more in their theoretical interpretations 
of what happens than in the descriptive overlap between the simpler 
forms of trial-and-error learning and the more convincing illustrations of 
insight. At the level of observation, insight is not the explanation of 
problem solution for a gestalt psychologist any more than for a stimulus- 
response psychologist. 

Kohler complains about the misinterpretations to which his book on 
apes led: 

When I once used this expression [insight] in a description of the intelligent 
behavior of apes, an unfortunate misunderstanding was, it seems, not entirely 

28 Guthrie and Horton { 1946), page 17. 
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prevented. Sometimes the animals were found to be capable of achievements 
which we had not expected to occur below the human level. It was then stated 
that such accomplishments clearly involved insight. Apparently, some readers 
interpreted this formulation as though it referred to a mysterious agent or 
faculty which was made responsible for the apes’ behavior. Actually, nothing of 
this sort was intended when I wrote my report. ... No question of inventions 
or other outstanding intellectual achievements is here involved, and, far from 
referring to a mental faculty, the concept is used in a strictly descriptive fashion. 29 

To the gestalt psychologist, insight exemplifies rather more clearly 
than other forms of learning the applicability of the laws of organization. 
It is these laws that explain insight, and it is the gestalt contention that 
the same laws explain other forms of learning. Only in that sense is in- 
sight for them the typical or characteristic kind of learning. 

Productive Thinking 

Wertheimer had lectured on thought processes for many years, but 
had published only a few fragmentary papers during his lifetime. He 
had, however, completed the manuscript of a small book just before his 
death. This was edited by his friends and appeared under the title Pro - 
ductive thinking (1945). In it a number of his experimental studies are 
summarized in his characteristic way, with penetrating qualitative analy- 
sis of simple situations serving to illustrate the differences between his 
approach and other approaches to which he is objecting. 

.c The two chief competing alternatives to adopting the gestalt ap- 
proach to thinking and problem-solving are said to be formal logic on 
the one hand and association theory on the other. Both of these alterna- 
tives are believed to be too limited to encompass what actually happens 
when an individual confronted with a problem finds a sensible solution. 

The distinction is made throughout between a blind solution in 
which the learner applies a formula, and a sensible solution in which 
the learner understands what he is doing in relation to the essential struc- 
ture of the situation. The blind solution is often an unsuccessful applica- 
tion of the formula to a situation not seen to be inappropriate. Experi- 
ments are cited, for example, in which school children are taught to 
find the area of a parallelogram by dropping lines from two corners 
perpendicular to the base, thus converting the figure to a rectangle, 
whose area can be found. Children who could do the examples per- 
fectly were baffled, however, when a parallelogram was presented in a 
new orientation, so that the “correct” steps of the procedure led to con- 
fusing results. They had learned the solution according to a blind pro- 

29 Kohler ( 1947 ), pages 341-342. 
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cedure. By contrast, the solution of a five-and-one-half-year-old child is 
reported. 

Given the parallelogram problem, after she had been shown briefly how to 
get at the area of the rectangle, she said, “I certainly don't know how to do that ” 
Then after a moment of silence: "‘This is no good here” pointing to the region at 
the left end; ""and no good here” pointing to the region at the right. 



FIGURE 30. Troublesome parts in child’s attempt to apply rectangle theory to 
parallelogram. After Wertheimer (1945), 


m l|‘ 


‘"It's troublesome, here and there” ^JPtingly she said: “I could make it 
right here . . . but . . . Suddenly she cried out, ""May I have a scissors? 
What is bad there is just what is needed here. It fits.” She took the scissors, cur 
vertically, and placed the left end at the right. 30 



FIGURE 31. Child’s solution of parallelogram problem with a scissors. After 
Wertheimer (1945), page 48. 


Another child, given a long parallelogram cut out of a piece of 
paper, remarked early that the whole middle was all right, but the ends — . 
She suddenly took the paper and made it into a ring. She saw that it was 
all right now, since it could be cut vertically anywhere and made into 
a rectangle. 

In cases # such as these the solutions appear in an orderly way, in 
line with the true ‘"structure” of the situation. It is this structural ap- 
proach which Wertheimer emphasizes. 

Children readily grasp such ""structural” solutions unless they are 
badlj^ taught in an atmosphere of blind repetitive drill. Given figures 
such as those on the left in Figure 32 and those on the right, they can 
easily sort out the unsolvable ones from the solvable ones. It is futile to 

30 Wertheimer ( 1945), page 48. 
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argue, says Wertheimer, that these distinctions are made on die basis of 
familiarity, as the associationist seems to believe. Children make the dis- 
tinctions because they know the essential nature of the solution. The 
structural features and requirements of the situation itself set up strains 
and stresses which lead in the direction of improving the situation, that 
is to say, to solving the problem. 


A-Figures 

<n 


B-Figures 






FIGURE 32. Applicability of solution of parallelogram problem to new figures. 
It is possible to change the A-figures sensibly so that they form rectangles. It is not 
possible to change the B-figures in this way. The ability of school children to solve 
the A-figures and to reject the B-figures is said to depend on something other than 
the familiarity of the figures. From Wertheimer ( 1945), page 20. 

The implications of Wertheimer's point of view for teaching are fairly 
clear. It is always preferable to proceed in a manner which favors dis- 
covery of the essential nature of the problematic situation, of the gaps 
which require filling in, so that, even at the cost of elegance or brevity, 
the proof is "organic” rather than "mechanical.” 31 

31 Duncker { 1945), page 45. 
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Experiments illustrative of gestalt theory 

OF LEARNING 

Three sets of experiments have been selected as representative of 
the topics stemming from the interests of Kohler, Koffka, and Wertheimer, 
These are: (1) experiments devised to test the hypothesis that memory 
traces change in a systematic manner with time, (2) experiments con- 
trasting organization with association in retroactive inhibition and related 
phenomena, and (3) experiments testing the relative roles of understand- 
ing and rote learning in memorization, retention, and transfer. The famil- 
iar insight experiments have already been referred to. 3 - 

Does the Memory Trace Undergo Systematic Change ? 

The position first advanced by Wulf (1922), that memory traces 
change according to the dynamic principles of organization, received 
early support from the experiments of Allport (1930) and Perkins (1932). 
Gibson (1929) found verbal factors a strong influence in producing what 
Wulf called normalizing, that is, each reproductions becoming more like 
a real object on successive attempts to reconstruct what was seen. 33 He 
therefore was somewhat critical of the theory of intrinsic factors within 
the memory trace producing the change. 

There is no doubt but that reproductions are often inaccurate and 
often exhibit the kinds of changes that Wulf suggested. But these changes 
are often found in the very first reproduction, so that the change may be- 
long more to perception than to memory. The crux of the matter lies in 
the finding of progressive changes with lapse of time. 

Most of the experiments that have shown changes demanded by the 
theory have used the method of successive reproductions, which intro- 
duces the disturbing factor that the second reproduction is affected by 
the first, the third by the second, and so on. An extreme illustration of 
changes which can be made from one reproduction to another is that 
shown in Figure 33, w T here each person made one reproduction based on 
perceiving what the person before him had drawn. Mere lapse of time 
between seeing die original and reproducing it would scarcely have re- 
sulted in, say, the tenth reproduction. 

In order to remedy the defect that experiments testing the theory 

3 - Review of additional experiments on insight may be found in Woodworth and 
Schlosberg (1954). A useful summary of the gestalt theory of insightful learning and 
its conditions may be found in Osgood { 1953), pages 605-610 

3S Gibson's results u ere extended and confirmed bv Carmichael, Hogan, and Walter 
(1932), Brown (1935). 
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REPRODUCTION 8 







REPRODUCTION 10 REPRODUCTION 15 REPRODUCTION 18 

FIGURE 33. Changes in figures within the method of serial reproduction. Each 
person views the reproduction by the person preceding him, and then passes his re- 
production to the next person. There are progressive changes and rbbov.t''""^ as in 
the series illustrated. From Bartlett ( 1932), pages 180, 181, by ( ami)! ■<!::< 

University Press, England, and The Macmillan Company, N.Y. 

used successive reproduction predominantly, Hanawalt (1937) had dif- 
ferent subjects reproduce the same figures after various intervals of time, 
with hut a single time interval for any one subject. Under these condi- 
tions, no progressive change was found. Such changes as occurred were 
commonly present on the first reproduction, and even might be present 
when the subject copied the figure present before him. 34 

34 Related experiments, with primarily negative results, are those of Irwin and 
Seidenfeld (i837), Irwm and Rovner (1937), Seidenfeld (1938), Goldmeier (1941), 
Hanawalt and Post ( 1942), Tennies (1942), Hanawalt (1952). 
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Hebh and Foord (1945), in a carefully controlled experiment, used 
the method of recognition rather than that of reproduction, asking the 
subject to select from a series of figures the one that had been originally 
presented.^There was only one retest for each figure learned by a sub- 
ject, but the retest was made after a lapse of time differing from one 
subject to another. The objects presented visually were a circle with a 
small portion of its circumference incomplete and an arrowhead figure. 
Presumably there would be a change either toward closure (leveling) 
or towaid emphasizing the broken part (sharpening). The circle may be 
used for illustration. Testing was done by selecting from a series of 
broken circles the one which was said to have been seen before. Although 
there was some variability, the trends were not in support of Wulfs 
theory, and there were no progressive changes evident with the passing 
of time. 

The conclusions to be reached from these experiments are certainly 
damaging to the gestalt interpretation of spontaneous changes in the 
memory trace. They do not deny the gestalt interpretation of changes in 
perception, or of progressive changes in successive reproduction, where 
successive perception is involved. 

Bartlett's ( 1932 ) experiments on repeating stories over long intervals 
are perhaps more instructive in some ways than the experiments repeating 
line drawings. They reveal rather striking tendencies toward structuring 
the story so that it “makes sense," and as the story grows older it also 
gets shorter and irrelevant details dropped out. This led Bartlett to 
speak of the productive nature of memory as contrasted with its reproduc- 
tive nature. However, the gestalt claim with respect to changes in the 
trace was based on the experiments of Wulf, and the refutations have 
been based on experiments using very similar materials. A common-sense 
use of Wulfs classification of changes into leveling, sharpening, and nor- 
malizing has been made by social psychologists, 3r * but these uses probably 
do not require confirmation of spontaneous changes in the trace. Krech 
and Crutchfield state, for example, that successive reproduction is char- 
acteristic of “real life," that is, “an occasional recall (without checking on 
the accuracy of the recall) of an event that has occurred in the past." 3G 

These experiments illustrate how a theory leads to a particular variety 
of experiment not suggested by other theories* Despite the negative re- 
sults, the experiments are useful. Experimentation is not intended merely 
to confirm theories; an equally important role is to refute, to correct, 

3r> E.g., Allport and Postman ( 1947). 

36 Krech and Crutchfield { 1948), page 127, footnote 1. 
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and to d elimi t the claims of a theory. The by-products have given some 
support for gestalt perceptual theory while almost invariably refuting the 
claims respecting the nature of spontaneous changes in the memory trace 
over time. 

The methodological possibilities have not yet been exhausted. Os- 
good 37 points out a failure to use figures whose perceptual dynamics 
are known on the basis of actual perceptual experiments (after-images, 
interpretation of the figure in dim light, etc.). Whether reproduction or 
recognition tests should be used cannot be given an easy answer. Using 
several figures at once causes a number of confusions in reproduction, 
not attributable to the spontaneous changes in a single trace. Possibly 
more subtle methods should be used in testing individual subjects re- 
peatedly; one suggestion made by Osgood is the use of hypnosis. 

Organization versus Associative Interference in Retention 

In a series of experiments published between 1933 and 1937, Kohler 
and his associates found results interpreted by them as interactions im- 
plying a field relation between a particular process and a particular 
trace. 38 

In her first study, von Restorff ( 1933) showed a relationship between 
perceptual laws and the recall of nonsense materials. She showed that 
in the recall of nonsense material, part of the difficulty lies in the homo- 
geneity of the material. If lists of paired associates are constructed so 
that one pair is of very heterogeneous material, this pair will be retained 
much better than the pairs of items representing materials more fre- 
quently repeated within the list. The interpretation is that the hetero- 
geneous items stand out like a figure on the ground, exactly as in percep- 
tion. This favorable structuring in perception turns out to be favorable 
also for recall. Muller ( 1937 ) went on to show that the recall of isolated 
or unique material was favored even when the unique items were dis- 
tributed over eight separately learned and recalled lists. The final task 
was the unexpected recall of whatever could be remembered from the 
eight lists. Six of the eight lists contained one kind of unique item, while 
two of the lists contained another kind of unique item. Recall favored 
the rarer unique items found only in two of the eight lists. 

In the study of Kohler and von Restorff (1935) an ingenious ar- 
rangement was used to test ^spontaneous recall” when there was a min i- 

. 37 Osgood { 1953), pages 590-591. 

33 von Restorff ( 1933 ), Kohler and von Restorff (1935), Bartel (1937), Muller 
{1937}. Aspects of these experiments were reviewed by Kohler ( 1938) ( 1940). 
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mum of induced if set” to recall. The laws of dynamics are best revealed 
when the results of process-trace interaction emerge naturally, without 
any special instruction to “search” for relationships. 

All subjects first worked on the following computational problem: 

x = 2l (91/7 4-6) 4- 14 

Most subjects went through the usual steps: 

First, 91/7= 13 (1) 

Therefore, 

x = 21 (13 4- 6) + 14 (2) 

= 21(19)4-14 (3) 

Performing the major multiplication: 

21 
X 19 
189 
21 

399 (4) 

Hence, 

x = 399 4* 14 = 413 (5) 

At this point the experimenter pointed out, somewhat casually, that 
the subjects might have noticed a somewhat easier way of multiplying 
21 X 19, using the formula (20 -f 1 ) (20 — 1) = 20 2 — 1 = 400 — 1 = 
399. The later retention test required recalling the pertinence of the 
principle that (a -f b) (a — b ) = a 2 — b 2 . 

After this exercise in computation, and the casual suggestion of an- 
other solution, half the subjects went on to do problems in arithmetical 
computation, while the other half worked on matchstick problems. Then 
both groups were again assigned a problem related to the one outlined 
above. The prediction was that the new problem would be detected as 
similar to the first one more frequently by those working on matchstieks 
than by those who had continued to work on arithmetic. That is, the 
computation problem would be more distinctive for those for whom 
it had been the only computation problem, and the new process would 
therefore be more likely to communicate with the memory traces from 
the first problem. 

The prediction was borne out. The “short-cut” method was used more 
frequently by those whose intervening activity was with matches than 
by those whose intervening activity was additional arithmetic. (See 
Table 4.) Upon questioning, the other subjects could easily remember 
having been told about the short cut. This bears out the importance of 
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arranging the experiment so as to test spontaneous recall if the dynamics 
of the process are to be exposed. Bartel (1937) continued with related 
experiments, with consonant results. 

Muller (1937) also studied a problem familiar in the history of 
retroactive-inhibition experiments, the problem of the point in the inter- 
val between learning and recall at which the\interpolated material pro- 
duces the greatest amount of interference^ Mullers findings supported 
the view that retroactive inhibition was at a maximum near the point of 
reproduction, suggesting that the action was really proactive rather than 
retroactive. 


TABLE 4. Spontaneous Recall as Affected by Homogeneous and Heterogeneous 
Intervening Activity (Kohler and von Restorff, 1935, page 79) 



Continued with 
Arithmetic 
( Homogeneous 
Activity ) 

Worked with 
Matchsticks 
(Heterogeneous 
Activity ) 

Total 

Subjects 

Recalled short cut 

9 

27 

36 

Failed to recall short cut 

25 

10 

35 

Total subjects 

34 

37 

71 


Chi square s= 15.3 
P less than .001 


All of these experiments show a resemblance in design to experiments 
done in the tradition of associative learning. What is unique about them, 
or about the theories used to explain them? 

First, the experiments commonly employ both homogeneous and 
heterogeneous materials, in order to point to principles of organization 
from perception as applying also to retention and recall. Added cogency 
is given to these arguments by later experiments of Werner (1947), 
where he showed that boundary phenomena which, in perception, pre- 
vent the confusion of forms, also operate in proactive inhibition. Material 
that was strongly "bounded” did not intrude as erroneous responses in 
a second list, and produced less interference with the retention of the 
second list. 

Second, the conception of process-trace interaction is said to differ 
from explanations according to associative connections. Kohler notes, 
for example, that in the usual retroactive-inhibition experiment traces 
are poorly distinguished and are densely crowded, so that they no longer 
function according to their intrinsic natures. Kohler points out that die 
experiments on spontaneous recall do not have these objections, for they 
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permit the traces to function more normally, with results as predicted 
from principles of organization.* 9 

Drill versus Understanding in Memorization and Retention 

In his book on Organizing and memorizing (1940), Katona reports 
a number of experiments inspired by Wertheimer, who contributed the 
foreword to the book. 

Katona attempts experimentally to define and characterize two 
types of processes leading to recall: rote memorizing and understanding. 
When a list of nonsense syllables is memorized, the learner is forced 
to use the former process, because there are no organizing principles 
which will permit understanding to help. On the other hand, there are 
many kinds of problems which illustrate principles; in such cases learn- 
ing by understanding will have advantages. Simple and ingenious ex- 
periments were designed in which it was possible to commit die same 
material to memory with or without understanding, and then to test 
the results on new learning. 

One experiment consisted in the teaching of simple match tricks 
of the kind illustrated in Figure 34. The problem is to move 3 lines and 
in so doing to have only 4 squares left. The possible solutions are shown 
in the figure. There is a simple principle involved in all solutions, which 
is that no side must be used for more than one square. (There are 16 
matches making the original 5 squares; because these 16 matches are 
now to make 4 squares it is evident that each side can be used but 
once.) A number of different tasks were used, all ringing changes on 
the same general pattern. 

In his Experiment A, three groups were used, a control group, a 
memorization group, and a group practiced on examples. No preliminary 
practice was given the control group. The memorization group was shown 
the first problem (that of Figure 34); with one of its solutions. Then 
this same problem was presented in rotated form and the same solution 
shown. The memorization group was thus shown essentially the same 
solution four times, with the problem very slightly rearranged geo- 
metrically. The group which had practiced on examples experienced six 
different transitions from one situation to another in their preliminary 
practice period, although no general principle was enunciated. Follow- 
ing this preliminary practice, each of the three groups was given four 
tasks to solve; these were all new to the control group, but one of the four 
39 Kohler (1940), page 155. 
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tasks was familiar to the memorization and examples group. There was 
a retest four weeks later, with a new control group. In this retest three 
new tasks were presented along with one of the originally practiced tasks. 
Successes on the practiced task were about alike for the two groups, in 
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FIGURE 34. Match-stick problems. The assigned problem is to make 4 squares in- 
stead of 5 by moving 3 matches (all matches to be used in the solution). Four solu- 
tions are shown for each of two tasks. From Katona (1940), page 120. 


spite of the fact that the practiced task for the examples group was only 
one of three tasks practiced, while the memorization group had spent all 
of its practice period on this one task. The advantage of the examples 
group was more pronounced on new tasks, as shown in Table 5 . 


TABLE 5. Comparison of Scores * on New Tasks of Groups Practiced in Different 
Ways (Katona, 1940, page 86) 



Immediate 

Test 

Retest After 

4 Weeks 

Control Group 

1.06 

2.20 

Memorization Group 

1.79 

2.84 

Group Practiced on Examples 

3.92 

6.24 


* Weighted scores, with possible score of 10. The differences between the examples 
group and the others are statistically significant. 
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Why was the group practiced on examples superior? Presumably 
because the more varied experience had produced some measure of un- 
derstanding. Verbal reports showed that this understanding was frag- 
mentary, taking such forms as “filling up holes increases the number of 
squares”; “spreading it out decreases the number of squares.” In any case, 
the method produced a more varied attack, and alternative solutions to 
the familiar problems were used in a way not true for the memorization 
group. Memorization sometimes tends to narrow rather than to increase 
the range of problem-solving. 

Katona concludes that learning with understanding not only im- 
proves retention of that which is learned, but better qualifies the learner 
to move forward to new learning. Thus understanding is important for 
transfer. Some of Katona’s conclusions have been criticized by Melton 40 
on statistical grounds, but Melton accepts as demonstrated the following 
points on transfer following learning by understanding: 

1, A group which has learned by rote memorization is little better 
than an unpracticed control group in the learning of new tasks, while a 
group which has learned with understanding learns new tasks much more 
readily. 

2. When once tasks have been learned with understanding, repeated 
tests with new tasks result in progressive improvement in performance. 
By contrast, if the repeated tests are done with practiced tasks, the effi- 
ciency of performance on new tasks is reduced. 

Katona concludes from all this that there is a real difference in what 
happens when meaningless memorization goes on and when learning 
takes place with understanding. He believes that the two kinds of learn- 
ing are genuine and should be distinguished. This does not mean that 
he believes that there are two fundamental prototypes of learning, cor- 
responding to the two types of arrangement under which learning takes 
place. According to him, the underlying factor is organization. Only when 
better organization fails do you get the extreme picture of rote learning, 
which is itself a special form of organization resorted to with compara- 
tively incoherent materials. The distinction in kind is not one of ease or of 
difficulty, for it is sometimes easier to learn by rote than to learn by under- 
standing. The advantages of learning by understanding are that mean- 
ingful learning is applicable to new situations and is more enduring. 

40 Melton (1941). For a reply see Katona (1942). Some features of Katonas ex- 
periments have been satisfactorily repeated and extended by Hiigard, Irvine, and 
Whipple ( 1953), Hiigard, Edgren, and Irvine ( 1954 ). 
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THEORIES OF LEARNING 


ESTIMATE OF GESTALT THEORY OF LEARNING 

Gestalt Theory and the Typical Problems of Learning 

The gestalt psychologists find a somewhat distorted emphasis in 
conventional treatments of learning, so that the problems typically empha- 
sized are not the most natural selection of problems from their stand- 
point In order to maintain the symmetry of comparative study of the 
different positions, however, the same list will be followed which was 
used to summarize the stimulus-response positions. 

1. Capacity. Because learning requires differentiation and restruc- 
turing of fields, the higher forms of learning depend very much upon 
natural capacities for reacting in these ways. Poor methods of instruction, 
however, may be responsible for some inability to face new situations, 
for a “blindness” which might be confused with stupidity. 

2. Practice. Changes go on within repetition, not as a result of repe- 
tition. Practically all psychologists now agree that this is so, but they 
differ with regard to the pertinent processes which go on within the 
repetitions. From the gestalt point of view, repetitions are successive 
exposures, bringing to light relationships to enter into restructurization. 
To Koffka, they also make possible the consolidation of trace .systems, 
which is as near as any gestalt psychologist comes to saying that re- 
sponses become fixed by repetition. 

3. Motivation. Goals represent end-situations, and as such modify 
learning through the principle of closure. The processes leading to the 
successes or failures get transformed by their consequences. The em- 
pirical law of effect is accepted, but Thorndikes interpretation of the 
blind action of effect is denied. 

4. Understanding. The perceiving of relationships, awareness of 
the relationships between parts and whole, of means to consequences, are 
emphasizedj^ the gestalt writers. Problems are to be solved sensibly, 
structurally, organically, rather than mechanically, stupidly, or by the 
running off of prior habits. 

5. Transfer. The gestalt concept most like that of transfer is trans- 
position. A pattern of dynamic relationships discovered or understood in 
one situation may be applicable to another. This is in some respects like 
Judds generalization theory of transfer. 41 There is something in com- 
mon between the earlier learning and the situation in which transfer is 
found, but what exists in common is not identical piecemeal elements 

41 Judd (1908). 
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but common patterns, configurations, or relationships. One of the ad- 
vantages of learning by understanding rather than by rote process is 
that understanding is transposable to wider ranges of situations, and less 
often leads to erroneous applications of old learning. 

6. Forgetting . Koffka relates forgetting to course of changes in the 
trace. Traces may disappear either through gradual decay (a possibility 
hard to prove or disprove), through destruction because of being part of 
a chaotic, ill-structured field, or through assimilation to new traces or 
processes. The last possibility is familiar as a form of theory of retro- 
active inhibition. Traces which continue to exist may at a given moment 
be unavailable. While little is known about this, Koffka believes it must 
have something to do with ego organization. 42 Finally, there are instances 
of forgetting in which a process fails to communicate with an otherwise 
available trace. The forgetting of an intention u would presumably clas- 
sify here. This is also an ego problem. 

In addition to such forgetting, there are the dynamic changes which 
take place in recall, so that what is reproduced is not earlier learning 
with some parts missing, but a modified trace which is productive as well 
as reproductive. 

General Aspects of Gestalt Theory 

In discussing gestalt psychology solely as a theory of learning, some 
of its more general features have been sidestepped, especially its philo- 
sophical orientation and its relation to biology. 

The objection to association theory of learning is part of the holistic 
emphasis within the general theory, and coherent wuth the opposition to 
"atomistic” explanations according to connections between parts. The ob- 
jection to sensations as elements of perception (“the bundle hypothesis”) 
is carried over in the objection to stimulus-response connections as ele- 
ments of habits. 

The phenomenological standpoint, often stated by gestalt psycholo- 
gists as opposed to the prevailing positivistic position, is not easy to char- 
acterize satisfactorily in limited space. Phenomenal observation is more 
“subjective” than behaviorism, but less sophisticated and specially trained 
than the introspection recommended by Titchener. The recommended 
variety of observation is natural and childlike, appreciative rather than 
analytical. For example, Kohler's later accounts of insight depend upon 
such a phenomenological description of events. Insight in this context in- 

42 Koffka (1935), pages 525-527. 

43 Birenbaum (1930). 
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eludes such everyday experiences as enjoying a cool drink. We refer the 
pleasures we feel to the taste of the drink and to our thirst, through a 
natural “requiredness” of the relationships. We do not attribute our pleas- 
ure to accidental accompaniments, ‘not, for instance, to the spider on 
the wall, nor to the size of the chair before me, nor to the thousands of 
other things in my environment.” 44 These everyday experiences of the 
determination of events, here called insight, are said to refute the theory 
of chance contiguous associations. 45 

The interpretation of gestalt psychology as a field theory rests largely 
on the evidence assembled by Kohler in his Physischen Gestalten (1920), 
showing the relationship between gestalt laws of organization and well- 
established principles in physics and biology. The biological emphasis 
within gestalt has been carried forward by Gelb and Goldstein. 46 

Kohler has all along held isomorphism to be central to the under- 
standing of gestalt psychology. This is the principle that the underlying 
brain fields correspond in their dynamic aspects to phenomenal experi- 
ence. He believes psychophysical isomorphism follows from the principles 
of evolution, so that mental events are not alien to physical events, but 
are rooted in the laws of nature: 

Some fundamental properties of nature rather than any special arrange- 
ments in the organism are, I believe, the counterparts of essential mental facts. 47 

We shall return later (Chapter 12) to Kohlers more specific hy- 
potheses about brain fields, and his experiments related to them. 

The Interrelationship Between Perception and Learning 

Any estimate of the classical gestalt position would be incomplete 
which did not appraise the success with which the basic thesis has been 
defended that the dynamic laws of perception and of learning are alike. 
For the most part, these conjectures have been programmatic rather than 
worked out in convincing experiments. The point of view has been help- 
ful to the extent that it has brought emphasis upon organization, mean- 

44 Kohler ( 1947), page 346. 

45 For further discussion of the phc-orrenolocpcal position as interpreted by gestalt 
psychologists, see MacLeod (1947) ^ 1951;. The phenomenology of Snygg and Combs 
( 1949) does not belong within the classical gestalt tradition. 

46 The collaboration between Gelb and Goldstein began in the study of brain-injured 
soldiers of World War I, Gelb and Goldstein (1918). Goldstein’s matured views are 
presented in book form in Goldstein (1939) (1940) (1948). Wheelers organismic 
psychology picked up and extended the biological emphasis, and assimilated embryo- 
logical studies, but it must be considered an offshoot of gestalt psychology rather than 
part of the central core of development. So, too, many gestalt-affiliations can be found 
in Werner { 1948), but he is not, strictly speaking, one of the gestalt group. 

47 Kohler (1938), page 400. & & 
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ingfulness, and understanding, and has called attention to the importance 
of the structure of the problematic situation. The laws of similarity (or 
equality), of proximity, of closure, and of good continuation, are not very 
convincing as laws of learning, and the attempts made by Koffka to sum- 
marize his point of view on learning in terms of them is not very successful. 
As we shall see in later chapters, the reintroduction of cognitive fea- 
tures into learning has been important, whatever one’s judgment of the 
importance of the laws of organization may be. 

Insight as an Alternative to Trial and Error 

It is implied in gestalt theory that a learner acts as intelligently as 
he can under the circumstances which confront him, so that insightful 
solution of problems is the typical solution, if the problem is not too diffi- 
cult and the essentials are open to inspection. Fumbling and trial and 
error are resorted to only when the problem is too difficult, either 
intrinsically, or because of the way in which it is presented to the learner. 
This reverses the associationist position that trial and error is the typical 
method of attack and that reasoning is essentially “mental” trial and er- 
ror. The empirical facts of insight are as satisfactory as the empirical 
facts of trial and error. Learning theorists are not yet in agreement on 
one of the three possibilities: ( 1 ) that all learning is of one kind, basically 
like trial and error, from which insightful learning can be derived; (2) 
that all learning is basically of one kind, like insightful learning, with 
trial and error a derivative form; or (3) that more than one kind of 
learning occurs, of which trial and error and insight are two illustrative 
examples. A strong case can be made for the gestalt point of view that 
blind learning is not the prototype of all learning. But this is one of the 
unresolved issues, to face us in succeeding chapters of this book. 

In any case, the gestalt psychologists have sharpened the lines of 
cleavage in thinking about problems of learning, and by questioning most 
of what was conventionally accepted they have been of real service, re- 
gardless of whatever verdicts or compromises the future may produce. 


SUPPLEMENTARY READINGS 


BOOKS 

Ellis, W. D. (1938} A source book of gestalt psychology . 
Hartmann, G. W. (1935) Gestalt psychology, 

Koffka, K. (1924) Growth of the mind 
Koffka, K. (1935) Principles of gestalt psychology. 
Kohler, W. (1925) The mentality of apes. 



256 THEORIES OF LEARNING 

Kohler, W. (1929) (1947) Gestalt psychology. 

Kohler, W. (1940) Dynamics in psychology. 

Wertheimer, M. (1945) Productive thinking. 

SHORTER INTRODUCTIONS 

Hartmann, G. W. ( 1942) The field theory of learning and its educational conse- 
quences. Chapter 5 in The psychology of learning. Natl. Soc. Stud. Educ., 
41st Yearbook, Part II, 165-214. 

Heidbreder, E. (1933) Seven Psychologies , 328—375. 

Koffka, K. (1925) Mental development. In C. Murchison, ed. Psychologies of 
1925 , 130-143. 

Kohler, W. (1930) Some tasks of gestalt psychology. In C. Murchison, ed. 
Psychologies of 1930 , 143-160. 

Woodworth, R. S. (1948) Contemporary schools of psychology , 120-151. 

critical reviews 

Boring, E. G. (1930) The gestalt psychology and the gestalt movement. Amer. 
J. Psychol, 42, 308-315. 

Pratt, C. C. (1936) Review of Koffka’s Principles of gestalt psychology. Amer. 
J . Psychol. , 48, 527-531, 

Robinson, E. S. ( 1930) Review of Kohler’s Gestalt psychology. J. genet. Psychol . , 
37, 431-450. 

Spence, K. W. (1941) Review of Kohlers Dynamics in psychology. Psychol. 
Bull., 38, 886-889. 

Vernon, P. E. ( 1935-1936) Review of Koffka’s Principles of gestalt psychology . 
Character and Pers., 4, 92-94. 

representative experiments 

Adamson, R. E. and Taylor, D. W. (1954) Functional fixedness as related to 
elapsed time and set, J. exp. Psychol, 47, 122-126. 

Birch, H. G. (1945) The role of motivational factors in insightful problem- 
solving. J. comp. Psychol, 38, 295-317. 

Birch, H. G. (1945) The relation of previous experience to insightful problem- 
solving. J. comp. Psychol, 38, 367-383. 

Duncker, K. (1945) On problem-solving. Psychol. Monogr., 58, No. 270, ix, 
113 pp. 

Durkin, H. E. (1937) Trial-and-error, gradual analysis, and sudden reorganiza- 
tion: An experimental study of problem solving. Arch. Psychol., N.Y., 
No. 210. 

Gottschaldt, K. (1926) t)ber den Einfluss der Erfahrung auf die Wahrneh- 
mung von Figuren, I. Psychol Forsch., 8, 261-317. Translated and con- 
densed as “Gestalt factors and repetition” in Ellis (1938), pages 109-122. 

Hanawalt, N. G. (1952) The method of comparison applied to the problem of 
memory change. /. exp. Psychol, 43, 37-42. 

K5hler, W. (1918) Nachweis einfacher Strukturfunktionen beim Schimpansen 
und beim Haushuhn. Abb. d. kbnigl Preuss. Ak. d. Wissen, Phys. Math. 
Klasse, Nr. 2, 1-101, (Translated and condensed as “Simple structural 
functions in the chimpanzee and in the chicken” in Ellis (1938), pages 
217-227.) 

Kohler, W., and Restorff, H. von (1935) Analyse von Vorgangen im Spuren- 
feld. Psychol Forsch 21, 56-112. * 



CLASSICAL GESTALT THEORY 257 

Newman, E. B. (1939) F~77"t*:~r of meaningful material during sleep and wak- 
ing. Amer . J. Psycho 1 , V, /L 

Perkins, F. T. (1932) Symmetry in visual recall. Amer. J. Psychol. , 44, 473-490, 
Wertheimer, M. (1925) t)ber Schlussprozesse im produktiven Denken. Drei 
Ahhandlungen zur Gestalttheorie. Berlin: Eilangen, 164-184. Translated 
and condensed as “The syllogism and productive thinking" in Ellis (1938), 
274-282. 

Wulf, F. (1922) t)her die Veranderung von Vorstellungen (Gedachtnis und 
Gestalt). Psychol. Forsch., 1 , 333-373. (Translated and condensed as 
“Tendencies in figural variation" in Ellis (1938), pages 136-148.) 

FINAL NOTE 

No very satisfactoiy recent account can be found on the gestalt theory of 
learning. The following small book on gestalt psychology has gone through two 
German editions and has been very widely translated. Its 29 chapters, many of 
them of only two or three pages, give the range of applicability of gestalt think- 
ing, but the book is not very satisfactory on critical issues. The word “insight,” 
for example, is not used as a technical term, and does not appear in the index. 

Katz, D. (1950) Gestalt psychology , its nature and significance. 

The following thoughtful summary treats the major issues in terms of the 
contrast between cognition and stimulus-response theories: 

Scheeber, M. (1954) Cognitive theory. In G. Lindzey, ed. Handbook of social 
psychology , 91-137. 



Chapter 8 

LEWIN’S FIELD THEORY 


ALTHOUGH a member of the Berlin gestalt group, Kurt Lewin (1890- 
.1X1947 ) early began to break new ground, especially in studies of motiva- 
tion. After a number of productive years in Germany, he spent his last years 
in the United States, teaching from 1932 successively at Stanford, Cornell, 
Iowa, and the Massachusetts Institute of Technology, with a term also at 
Harvard. He became widely known and highly influential, and because of 
a warmth and infectious quality of personality developed a devoted follow- 
ing. Although his theory is not, strictly speaking, a psychology of learning, 
much of it is relevant. His doctoral dissertation, in 1917, was critical of the 
then current association theory of memorization and retention. The theory 
that he developed as a corrective was called by various names as it pro- 
gressed, such as topological psychology, but it is now usually referred to by 
its friends and adherents as a field-theoretical position. It is satisfactory to 
call it simply a field theory, without implying that it resembles in detail the 
field theories of modern physics. 

THE BEGINNINGS OF LEWIN’S THEORY 
The Lewin-Ach Controversy 

Ach belonged to the Wurzburg school which had supplemented con- 
ventional association with sets, determining tendencies, and the like, thus 
adding something more dynamic to the standard theory of strengthening 
associations through repetition. He designed a series of experiments in 
which a habit of reproducing nonsense syllables was set up through re- 
peated exposure, and then a determining tendency was introduced which 
might either facilitate or hinder .the tendency created by practice. Thus 
“will” was set against “habit,” and a possible quantitative measure of will 
proposed. 1 

*Ach (1910). 
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There were three arrangements of nonsense syllables in the original 
learning experiments of Ach. One was the usual or normal arrangement of 
heterogeneous pairs, such as bol-pid; another arrangement called for a 
rhyming syllable, as rik-tik ; the third called for inversion, as kep-pek. The 
subject learned one series of eight pairs of each of these kinds. The stimulus 
syllable of each pair was later used in test situations. In the test series, 
instructions were to reproduce what had been practiced, or to rhyme, or to 
invert. Thus the determining tendency could either sum with or contradict 
the associative reproductive tendency, or the reproductive factor could be 
present alone. Strength of reaction was inferred from latency of response. 
Ach found some results coherent with his theory, that the shortest reaction 
times were in the cases in which the associative (reproductive) tendency 
coincided with the determining tendency superimposed by instructions. 
His other results were somewhat irregular, but were interpreted by him as 
giving general support to his position. 

Lewin’s objection to Ach’s work was not that he introduced the deter- 
mining tendency, which Lewin essentially accepted, but that he added it 
to the conventional association theory without seeing that it was a foreign 
intrusion and led logically to the abandonment of the simple association 
theory. Lewin argued that the conflict which appeared in the experiment 
— and, indeed, conflict was convincingly demonstrated — was not between 
association and determining tendency, but between two determining tend- 
encies. It was his belief that there is no “force” within mere association to 
lead to reproduction, that reproduction itself must be motivated and im- 
plies a set to reproduce. 

Lewin’s experiments 2 were much like Ach’s, but a few modifications 
greatly changed the resulting interpretations. First a series of eight pairs 
of nonsense syllables was learned in which the second syllable rhymed 
with the first, dak-tak ged-ked. Then a list of eight pairs was learned in 
which the response syllable was the stimulus syllable spelled backward. 
Note that the only change in response syllables was in the consonants, the 
vowels being alike in both the rhyming and the reversed pairs. 

The first test series consisted in instructing the subjects to respond to a 
set of syllables by changing the vowel. Syllables from the previously prac- 
ticed series were inserted among unfamiliar syllables. If Aeh’s theory were 
sound, there should have been some conflict between “habit” and “deter- 
mining tendency,” but none was found. There were no wrong reactions, 
and the latent times for control and critical syllables were alike. 

A second test, however, produced conflict and retardation, such as 

2 Lewin (1917) (1922). 
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Ach found. When the subjects were told to rhyme a series including both 
syllables previously rhymed and syllables previously reversed, mistakes 
were made on the syllables previously reversed. Similarly, with instructions 
to reverse, errors were made in response to syllables previously rhymed 
when they were presented in the midst of syllables previously reversed. 
The interpretation is that the instruction to rhyme to syllables previously 
learned as rhymes was acted upon as though it had been an instruction to 
reproduce. When, then, a previously reversed syllable appeared, the faulty 
self-instruction to reproduce conflicted with the correct instruction to 
rhyme, and interference was shown. 

The occurrence of conflict in one case and not in the other, although 
the prior habits fixed by association were equally strong before both tests, 
confirmed Lewin in his belief that the existence of an association ( or, in his 
later terms, a cognitive structure) does not provide the “motor” for mental 
activity. There is always a tension system necessary for activity, including 
the activity of reproducing previously learned nonsense syllables. 

Lewin s contention, while it remains cogent, has lost some of its force 
as interest has shifted from the artificially motivated habit of rote learning 
to habits more intimately tied up to need and tension systems. For example, 
a conditioned avoidance response, strengthened through repeated rein- 
forcement, has a certain urgency about it, so that it breaks through instruc- 
tions to inhibit response. It is possible to treat the conflict between “habit” 
and “determining tendency” in this case much as Ach did, and to find 
results which seem to support Ach’s position, 3 The findings do not contra- 
dict Lewin s position, however, because in the case of conditioned avoid- 
ance responses there is a “motor” in the threat of punishment which is set 
against the verbal instructions not to respond. This “habit” is more than 
mere contiguous association leading to reproduction of associated items. 

This early recognition of the dynamic organization of reproductive 
tendencies is reflected in later experiments on the relationship between 
tension and retention, as shown in memory for finished and unfinished 
tasks, 4 and the tendency to resume unfinished tasks when the opportunity 
arises. 5 

Psychological Tension Systems 

Much that foreshadowed his later systematic writings appeared in an 
important paper by Lewin in 1926, shortly after Kohlers and Koffka’s 

3 Hilgard ( 1938 ) . 

* Zeigarnik (1927). 

6 Ovsiankina (1928). 
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views were becoming known in America. Lewin’s position was first intro- 
duced to American audiences by J. F. Brown ( 1929), three years after this 
paper appeared. 

For one thing, Lewin set himself squarely in favor of a psychological 
analysis of the actual situation, and against what he called accomplishment 
concepts. The practice curve for typewriting, scoring typing according to 
words typed per minute or some other convenient unit, rises, shows a 
plateau, reaches a limit, and so on — as though all that happened could be 
described as ‘"typewriting.” A more psychological analysis would show 
that what the beginner is doing and what the skilled typist is doing are 
entirely different: the beginners process is one of searching, but the 
searching process drops out as skill is mastered. This objection to accom- 
plishment is to be compared with Guthrie’s objections expressed in much 
the same way, several years later. 0 The difference is that Guthrie wishes 
to restrict his measurements to movements, which Lewin would class with 
accomplishments as lacking in sufficient psychological meaning. 

The typical non-gestalt conception of psychological causation, says 
Lewin at this time, is adhesion . One thing is attached to another so that 
revival of the first brings forth the second. Gestalt psychology is sometimes 
misunderstood as correcting association by accepting the adhesion prin- 
ciple but permitting it to apply only to parts of wholes. This notion is flatly 
rejected. No mere coupling principle can provide the energy for psychical 
activity. Psychological behavior depends upon energy related to psycho- 
logical tension systems. 

Perception of the world of meaningful things and events does not pro- 
vide the energy for activity, although activity aroused under need-tension 
systems may be steered by perception. This is not unlike Skinner s later 
notion of the role of the discriminated stimulus in operant behavior: the 
stimulus does not elicit the behavior, but it does set the occasion for it. The 
sequence of events from perception to satiation is as follows. The percep- 
tion of an object or event may give rise to a psychological tension (e.g., a 
desire), or it may communicate with a state of tension already existing in 
such a way that this tension system thereupon assumes control over motor 
behavior. The aroused “valences” ( attractions and repulsions of perceived 
goal objects) act as environmental forces steering subsequent behavior. 
This behavior then leads to satiation or to the resolution of tension so 
that a state of equilibrium is approached. 

At this early date ( 1926) Lewin went to some pains to point out the 
ridiculousness for scientific purposes of saying that “everything depends 

6 Guthrie ( 1935), page 163. 
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upon everything else/’ While as a gestalt psychologist he was concerned 
with wholes, he recognized as the most important problem the isolation of 
the "specific psychical units, personality spheres, and behavior wholes” in 
which one’s activities, emotions, intentions, wishes, and hopes are em- 
bedded. Whatever psychical unity there may be in the “ego” system, it is 
a 4 weak” gestalt, and we are dealing in the personality with a great number 
of "strong” gestalts which in part are in communication with each other, in 
part disclose no genuine unity at all. In adults the possibility of organized 
behavior depends upon a relatively complete segregation of a number of 
different tension systems. 

He devoted a number of pages to an analysis of "intentions,” how they 
are fulfilled, why they are forgotten. He arrives at the conclusion that they 
are quasi-needs, because their satisfaction is like the satisfaction of other 
needs, not dependent upon particular occasions or particular actions. Sub- 
stitute satisfactions may be found for intentions as for needs. 

This paper introduced many of the concepts to be developed in the 
later attempts to formalize and metricize the system. There are the needs 
and quasi-needs; the tension systems; perception (cognitive structure) as 
steering, but not as providing energy; valences; boundaries; a differenti- 
ated and complexly structured ego system. It is a natural development to 
the later more formal system. 


TOPOLOGICAL, VECTOR, AND HODOLOGICAL 
PSYCHOLOGY 

The more abstract and formal characteristics of Lewins system were 
detailed in a book and a monograph, Frinciples of topological psychology 
(1936), and The conceptual representation and measurement of psycho- 
logical forces (1938). He later stated the relationship of his viewpoint to 
problems of learning. 7 

The words “topological” and “vector” refer to types of geometry bor- 
rowed because they appear to provide the mathematics appropriate to the 
structure of psychological situations. Any mathematics, like any language 
or any system of logic, may be used in science if it happens to fit. Plane 
geometry and trigonometry do for surveying, and spherical geometry is 
useful for navigation, but Einstein had to turn to a geometry of four dimen- 
sions to handle relativity problems. The problem of selecting appropriate 
mathematical tools is familiar in choosing the proper statistical formulas 
7 Lewin (1942). 
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for a given set of data. In looking for a “fit” to his conception, Lewin hit 
upon topology and upon vectors. 

Topology is a non-metrical geometry of spaces, in which concepts 
such as "inside” and "outside” and "boundary” are used. Lewin believes 
that many psychological situations are structured in that way. Sociologists 
have long talked of "in-groups” and "out-groups.” It is such a usage which 
Lewin is trying to formalize in topological concepts. 

Vectors are borrowed from the mathematical system used in me- 
chanics to describe the resolution of forces. Unlike topology, vectors are 
metrical. A vector is usually represented by an arrow, the length represent- 
ing its force, the direction representing the line of application. The point of 
application may also be shown as a matter of convenience, but it does not 
belong as a property of the vector. Because much dynamic psychology can 
be described in terms of conflict and the resolution of conflict, the geometry 
of vectors finds appropriate application. 

His desire for a "qualitative” geometry having some of the character- 
istics of topology with a "quantitative” geometry of vectors led Lewin to 
suggest that a new geometry was needed to fit the problems of psychology. 
He specified some of its characteristics, and named it the geometry of 
hodological space . The root is from the Greek word hodos } meaning "path.” 
Hence hodology is to be a geometry of paths, somewhat as topology is a 
geometry of spaces. 

What Lewin intended by borrowing from topological and vector 
geometries is clear. He was searching for a mathematics which would be 
appropriate to psychological structure as he understood it. He has been 
attacked for using these mathematics incorrectly and for drawing im- 
proper analogies from them. 8 His system is no doubt vulnerable in some of 
these respects, but the relevance of his theories for the purposes of the 
present discussion does not rest upon the validity of his quantitative rep- 
resentations. 

The Psychological Field: Life Space 

The selection of geometry as an appropriate mathematics is natural 
because the psychological field is thought of as a space in which the person 
moves. This space is psychological, not physical; it is represented mathe- 
matically by a spatial diagram, but that does not mean that it exists in 
physical space. 

The conception of life space is a plausible one. It signifies that two 
8 London (1944). 
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people walking down the same street are going different places, and the 
worlds in which they are walking are to them different worlds. Or as I sit 
in reverie and make plans for tomorrow, I move in a world very different 
from that in which I sit. My life space is the space in which I live psycho- 
logically, as seen from my own viewpoint. It includes each and every ob- 
ject, person, idea, that I have anything to do with. It corresponds in many 
ways to the world about me, to the world of things, people, and ideas, 
being my selection from these, but it becomes my world always in edited 
and distorted forms. It is possible to distinguish between me and my life 
space, for I move about in my psychological space as others do in theirs. 
We interact in the real world, all the while producing changes in our re- 
spective psychological worlds. Because of correspondences between the 
psychological world and the real world, Lewin spoke of life space as quasi- 
physical, quasi-social, and quasi-conceptual. Life space is conditioned 
and influenced by the physical, social, and conceptual environment but 
cannot be identified with that environment. 

Life space may be inferred; it does not depend upon subjective report. 
That is, although life space is psychological, it is not strictly a private affair 
available only through introspection. A person is not always able to intro- 
spect about the forces acting in his life space at a given moment. Life space 
is a construct, like other scientific constructs, designed to account for the 
psychological situation at a given moment. 



FIGURE 35. Life-space, The situation represented is that of a boy who wants to 
become a physician. The person (P) is shown separated from the goal region ( -f G) 
by a number of regions with boundaries between them. The regions are college en- 
trance (ce), college (c), medical school (ra), internship (i), establishing a practice 
(pr). While the locomotion through these regions may take place in the future, they 
are in the present life-space as obstacles to the attainment of a goal which is now 
in view. By permission from Principles of topological psychology , by K. Lewin, Copy- 
right 1936 by McGraw-Hill Book Co., Inc. 

Locomotion in life space is delineated by a geometrical representation 
of the selection of alternatives, the examining of possibilities, the setting 
out toward goals, the experiencing of frustration, and the like. It may or 
may not involve locomotion in the real world. If I move toward an object 
of choice in the real world, I am also moving in my life space, but the two 




LEWIN’S FIELD THEORY 


265 


motions are not to be identified; in life space we represent the psycholog- 
ical significance for the person of the motion in physical space. A typical 
life-space diagram is shown in Figure 35. 

Occasionally life space corresponds very closely to the real world 
with which the person is in commerce. In that case he is said to be in 
touch with reality. At the other extreme, we find individuals living in a 
world of fiction and fantasy — out of touch with reality. In order to describe 
these differences, Lewin proposes reality-irreality as a dimension of life 
space. The very young child may not distinguish between his hopes and 
wishes and the actual circumstances of life. Presumably there is a greater 
differentiation between reality and irreality as we grow older, although 
normal adults are still capable of wishful thinking. 

The Person in Life Space 

The person is often represented as a point moving about in his life 
space, affected by the pushes and pulls upon him, circumventing barriers 
in his locomotion in his own life space. If the person is to be treated as a 
particle in a field, this representation as one point in space is a convenient 
one. 

The person is also represented in a different way, as a differentiated 
region in life space, the person having a structure of its own. There are 
more superficial and deeper (less accessible) layers of the personality; 
there are different degrees of complexity of organization and differentia- 
tion; some parts are in communication, while other parts have strong bar- 
riers between them. The structure of the person is sometimes diagrammed 
as in Figure 36. As Leeper 9 points out, a great deal of the experimental 
work of Lewin and his students has dealt with constructs related to the 
person, including studies of interrupted tasks, satiation, anger, and rigid- 
ity. 

Lewin was not very clear about the exact relationship between the 
person and his life space. Deutsch 10 believes that Lewin used the term 
“person” in three ways, though Deutsch thinks Lewin might not have 
agreed with his analysis. One use is as the behaving person in life space, 
essentially a geometrical point upon which the forces diagrammed in life 
space impinge, determining its locomotion. A second use makes the prop- 
erties of the person, in interaction with the environment, produce life 
space. His needs, beliefs, and values, determine how life space will be 

9 Leeper (1943), page 62. A summary of earlier work was called A dynamic theory 
of personality ( Lewin, 1935 ). 

10 Deutsch ( 1954), pages 190-191. 
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structured for him. The third possibility is that the person and life space 
are equivalent, being two ways of representing the same facts. That is, 
the diagram of Figure 35 represents the changes in the psychological en- 
vironment concurrent with the changes in the tension systems shown in 
Figure 36, and these are two aspects of the psychological situation at the 
time. Lewin’s own preference would probably have been to combine the 
first and thir d of these ways of considering the person, that is, he would 
conceive the “person” as the behaving person, but he would have the be- 
having person represent at once a differentiated region in life space. 



FIGURE 36. Representation of the person. The motor and perceptual regions, 
because they have commerce with the environment (E), are represented at the 
boundary zone (M). The portion lying within represents the inner-personal region 
(I), which has more central parts (c) and more peripheral parts (p) Under differ- 
ent circumstances of stress, the inner-personal region undergoes differentiation or 
dedifferentiation ( ** primitivation” ) . By permission from Principles of topological psy- 
chology , by K. Lewin, Copyright 1936 by McGraw-Hill Book Co., Inc. 

Valence 

When a person is attracted by an object that object is said to have 
positive valence , an analogy with the valence between atoms in chemistry. 
The person tends to move toward a region in life space that has positive 
valence, that is, toward an attractive goal or into a region where satisfying 
activity is possible. To use an older vocabulary, as adopted by Hull ( 1952), 
positive valence leads to adient activity. By contrast, negative valence 
refers to an object, or region in life space, from which the person is re- 
pelled. That is, objects and activities that a person avoids have negative 
valence, and they lead to abient activity. 

Because life space may contain regions with several valences active 
at once, these give rise to conflict, especially when the opposing forces 
are approximately in balance. Lewin specified three chief kinds of con- 
flict: u 

1. Two positive valences. A child has to choose, for example, between 
going on a picnic and playing with his comrades. 

11 Lewin ( 1935), pages 88-91. The examples cited are his. 


LEWIN'S FIELD THEORY 267 

2. A simultaneous positive and negative valence. A child is offered 
a reward for a school task he does not wish to perform. 

3. Two negative valences. A child is threatened by punishment if he 
does not do a task assigned to him. 

The experimental analysis of conflicts of these types has proven fruit- 
ful. 12 

The assigning of the valence to the object represented in life space 
does not, of course, imply that the valence actually inheres in the object. 
While something like an inverse-square law applies to valence (“the force 
of attraction varies inversely as the square of the distance”), the theory 
requires that the distance must be measured psychologically, just as the 
object carrying valence is the object as perceived by the person. The same 
object will carry different valences for different people because each of 
them perceives it differently. 

Distance and Direction 

Life space is quasi-physical, so sometimes there is a gross corre- 
spondence between locomotion in life space and in physical space. A 
conflict between two objects of which one is on the person's right and 
one on his left, inviting locomotion in opposite directions, will be repre- 
sented in a life-space diagram by regions and paths in opposite directions. 
When physical space and life space are in close correspondence, physical 
distances and directions may be used for experimental purposes as ap- 
proximations of distances and directions in life space (that is, Euclidean 
space is used then to represent hodological space). 

But life space often does not correspond to physical space. A choice 
that is arrived at through reflective thinking is a locomotion in life space 
quite as genuine as a locomotion involving gross muscular movement. 
In such cases, where there is no physical counterpart of locomotion, the 
specification of distance and direction becomes very difficult. 

Lewin had little to say about distance. If one path includes another 
on the way to a goal, it is evident that the included path is shorter than 
the entire path. 13 That is all that we can say about distance, except in cases 
in which we assume a correspondence between environmental (physical) 
space and life space. 

Lewin gave a fuller treatment to direction. The direction of a path 
is defined by the start of the path, “by the beginning step of the distin- 

12 See the early experiments of Fajans (1933a) (1933b), and the later ones by 
Hovland and Sears ( 1938), Sears and Hovland ( 1941 ), Barker ( 1942) { 1946). 

13 Lewin ( 1938), page 50. 
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guished path/' 14 There are four possible directions of locomotion. The 
letter d ? with appropriate subscripts, serves to symbolize the path whose 
direction is defined: 

(1) The direction of the distinguished path from A to G would be d A)G . To 
indicate a particular route from A to G, the expression might be d AtBfG , the ad- 
ditional intermediate letter or letters indicating distinctive portions of the path. 

(2) Sometimes the person tends to stay in the present region (continue his 
present activity). The direction of this tendency is d A>A . 

(3) Conversely, the direction of a tendency to leave the present region in 
so far as this tendency involves* a disregard of what other regions are moved 
toward, or the direction of a tendency not to enter the region A, would be d At _ A . 

(4) Finally, the tendency sometimes is to increase the distance from a not- 
present region. This tendency has the direction d Aj _ G . 15 

Lewin showed the applicability of his method of representation to a 
variety of experiments often studied in the learning laboratory: maze 
learning (familiar and unfamiliar mazes, mazes with one-way doors, 
T-mazes), the roundabout-route experiments, the use of tools, the obstruc- 
tion box, extinction of lever-pressing, goal-gradient behavior, various 
conflict problems. Whether or not his theory is a "learning theory” its 
relevance to learning' is established by these applications. 

An Example of the Representation of Experimental Results 

Hull (1933) and Leeper (1935a) performed similar experiments in 
which they taught rats to turn one way through a maze when hungry, 
another way when thirsty. A rather slight difference in physical arrange- 
ments made an enormous difference in the ease of acquiring the habit. 
Hulls rats took 25 periods of 8 days each ( a total of 1000 trials at 5 trials 
per day) before achieving about 80 per cent correct choices. Leeper s 
rats performed as well after a single 8-day period (a total of 40 trials). 
Hull's animals ran to food or water in a common goal-box, finding the 
door closed if approached from the wrong side. When hungry they could 
enter the goal-box and find food only if they ran to the left; when thirsty 
they could find the door open and water in the goal-box only if they ran 
to the right. (I am disregarding the appropriate balancing out of right 
and left in the careful design of the experiment. ) Leeper s rats ran in- 
stead to separate end boxes on the food days and the water days. Both 
boxes were open, so that if they ran the wrong way on a water day 
they found food in the end-bqx they entered; conversely, if they took 
the wrong path on a food day, they found water in the box entered. 

14 JLewin ( 1938 ) , page 27. 

15 The summaryTs quoted from Leeper (1943), page 125. 
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Let us now see how Lewin diagrammed this difference. 16 

Lewin chose to represent the two experiments by diagrams spatially 
similar to the actual experimental arrangements, slightly stylized by con- 
verting mazes with right-angle turns into circular ones (Figures 37 and 
38). There is great freedom in the construction of topological or hodo- 
logical representations. Hulls problem has to be represented, however, as 


Of Gw Of Gw 



FIG. 37 FIG. 38 


FIGURE 37. Representation of the situation in Hull’s experiment (Hull, 1983). 
A, entrance box; B, goal box; Gf, food goal, reached on hunger day by path A, 
E,B; Gw, water goal, reached on thirst day by path A,D,B; /v, t . , force generated by 
\alence of food; force generated by valence of water. Entrance from the in- 
correct side was blocked off by a closed door at the goal-box. ( Modified slightly from 
Lewin, 1938, page 182, ) 

FIGURE 38. Representation of the situation in Leeper’s experiment ( Leeper, 
1935a). A, entrance box; K, goal box m which food was also present, whether the 
animal was hungry or thirsty; L, goal box in which water was always present, whether 
the animal was hungry or thirsty; Gf, food goal, reached by path A,E,K; Gw, water 
goal, reached by path A,D,L; /a, force generated by valence of food; /a,g w , force 
generated by valence of water. Entrances to both goal-boxes always open. (Modified 
slightly from Lewin, 1938, page 182.) 

two paths to a common goal (Figure 37), while Leeper’s must be repre- 
sented as two paths to different goals ( Figure 38) . Leeper s problem could 
have been diagrammed as a T-maze or a Y-maze without doing violence 
to its hodological representation. The important difference between the 
two figures lies in the forces set up by the highly valenced goals. 

16 Lewin (1938), pages 179-184. It is instructive to compare his explanation with 
that by Leeper (1935a), pages 31-32, and that by Hull (1952a), page 140, Both 
Leeper and Hull attribute some influence to the “latent learning” resulting from seeing 
food when not hungry and water when not thirsty. In his more formal treatment, Hun 
wrote as though the Hull-Leeper results were alike, thus ignoring the 25: 1 difference in 
difficulty between the two experimental arrangements. 
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Of the differences between the experiments, Lewin writes: 

For determining the direction of the forces / Ai g f /a,gw point A in 
the experiment of Hull, there remains the question of determining the relation 
between the directions d A(D and d A>E . In Leeper’ s experiment both are clearly 
opposed. In Hulls experiment we have to deal with a rather complicated rela- 
tion between the directions of two paths from the same starting region to the 
same end-region. By discussing the geometry of the situation we have seen that 
those directions have to be considered ‘partly equal.” That would mean in our 
case: 

d A ,D ( = ) d A> E 

which involves both a certain amount of equality and difference of these direc- 
tions. We expected that the difference of both directions should become more im- 
portant the greater is the potency of the means, relatively to that of the ends. 
Indeed, I would assume that one of the reasons for the difficulty in Hull’s experi- 
ments is that the regions E and D are relatively slightly emphasized means to a 
common end. 

At any rate, the directions d AtD and d AfE are not clearly opposed to each 
other, as in Leeper s experiments. In addition, the force f A> Gf in Hull’s experiment 
points in two directions, d A D and d AtE . Also the force / A(Gw points in these two 
directions. In Leeper’s experiment each of these forces points only in one of 
these directions. 17 

Apart from pointing out the obvious differences in experimental ar- 
rangements, Lewin proposes two explanations for the greater ease of learn- 
ing by Leeper’s rats than by Hull’s : ( 1 ) the two paths are less differenti- 
ated for Hull’s rats, and ( 2 ) the forces determining the choice at the 
starting point are more nearly in balance for Hull’s rats. Unfortunately, 
Lewin’s explanation fails to tell us how he would propose to handle a 
number of features of the experiments: 

How does experienced goal satisfaction or frustration affect the cognitive 
structure on the next trial? 

How does the state of hunger or thirst modify the cognitive structure of the 
rat, or the forces or valences operating? 

Of what importance are the closed doors in Hull’s experiment? 18 

If the forces r.c trig on Hull’s rats are in balance, why do the rats eventually 
solve the pr il>\ mr 

Lewin succeeded in making it plausible that there should be greater 
difficulty for Hull’s rats than for Leeper’s, but he clearly did not account 
in any detail for the findings. One can only conclude that, if other ex- 

17 Lewin (1938), page 183. 

18 Lewin noted their importance, and gave an illustration from human behavior to 
make the difference between Hull’s experiment and Leeper’s seem less paradoxical, 
Lewin ( 1938), page 181. But the closed doors do not appear in the formal treatment of 
the experiment, 
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planations are as incomplete as this one, distance and direction are left in 
an unsatisfactory state. 19 

Behavior as a Function of the Present Life Space 

Perhaps even more forcefully than his gestalt colleagues, Lewin in- 
sisted that behavior depends upon the present, not upon the past or the 
future. Past events, like future events, do not exist now, and therefore in 
his sense, cannot have effects now. While past psychological fields are part 
of the origin of the present field, their relationship to the present is so 
indirect that their explanatory value is slight. 

As pointed out earlier, the notion of contemporary causation is a 
sound one, and need not lessen the importance of a knowledge of the past 
in contributing to an understanding of the present. It is true, however, 
that in adopting a logical position there often go along with it certain 
overtones of interest and preoccupation not essential to the position. 
Lewin’s whole-hearted insistence on the importance of the present ac- 
centuated his interest in conflict and other contemporaneous motivational 
situations, and lessened his interest in progressive changes, such as those 
studied in conventional learning situations or in the psychological clinic. 

Lewin justifies his neglect of learning on different grounds. He says 
that learning (i.e., the influence of the past on the present) has been much 
overemphasized as a reaction against earlier prescientific teleology (i.e., 
the influence of the future on the present). Field theory’s emphasis on 
the present he believes to be a necessary corrective, establishing a pre- 
ferred balance of interest. 20 

The Time Dimension 

Despite his preference for dealing with present causes, Lewin was 
not, of course, oblivious to the time dimension. Because learning theorists 
tend to be preoccupied with changes that take place in time, it is of in- 
terest to examine Lewin’ s position. 

In a discussion defending the principle of contemporaneous causa- 
tion, at the same time showing that he was aware of historical factors, 
Lewin had this to say: 

Without altering the principle of contemporaneity as one of the basic 
propositions of field theory, we have to realize that to determine the psychological 
direction and velocity of behavior (i.e., what is usually called the meaning” of 
the psychological event), we have to take into account in psychology as in 

ld This conclusion was reached also by Leeper (1943), who then suggested im- 
provements that would require major changes in Lewin s system. 

Lewin (1942), page 222. 
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physics a certain time-period. The length of this period depends in psychology 
upon the scope of the situation. As a rule, the more macroscopic the situation is 
which has to be described the longer is the period which has to be observed to 
determine the direction and velocity of behavior at a given time . 21 

Lewin accepted from L. K. Frank (1939) the dimension of life space 
called “time-perspective,” which includes the psychological past and the 
psychological future. The small child lives largely in the present, and as 



FIGURE 39. The time dimension in life space. The sequence of events starts at 
the top and moves downward. Note the movement of the person ( x ) along path p s 
in a small segment of time, and along path p s in a larger segment of time. Lewin 
(1943), page 302. 


he grows he conforms his behavior to events in the more remote future. 
Part of the overemotional reactions of adolescents may be due to their 
attempt to face adult problems before they have sufficient time-perspective. 
Then they interpret a momentary frustration as an enduring defeat. 

21 Lewin (1943), page 301. 



273 


LEWIN’S FIELD THEORY 

No very satisfactory diagrammatic device was found for adding the 
time dimension to life space. One kind of diagram resembles those used 
to show the pathways through the spinal cord. Then the time dimension 
can be thought of as the long axis of the cord, the cross sections corres- 
ponding to momentary life space. Such a diagram is given in Figure 39. 

Another diagram presents life space captured at various segments 
of time, as though life space moves along on a continuous belt and can be 
examined at any moment. Such a diagram, for life space in relation to a 
planned future event, is given in Figure 40. In this diagram there appears 
also a vertical dimension, indicating the difference between life space on 



FIGURE 40. Time perspective in planning. The sequence of events .starts at the 
left and moves to the right. Each segment is as seen by the person at the time repre- 
sented. The vertical dimension is that of irreality (wish; fantasy). The coming to- 
gether of the two levels m the future means that there is less diffeienee between 
reality and irreality as something planned is more nearly achieved. Note that at earlier 
stages the wish makes the person appear to be nearer to the goal in irreality than in 
reality. From Barker, Dembo, and Lewin (1941), page 210. 

the plane of irreality (fantasy), and on the level of reality. Note that the 
future, as now perceived, appears in present life space, but life space 
itself changes as time moves on. 

APPLICATIONS TO LEARNING 

Learning as a Change in Cognitive Structure 

As one learns, one increases in knowledge. What does it mean to 
know more? It means to have a more highly differentiated life space, in 
which there are more subregions connected by defined paths. That is an- 
other way of saying that we know facts in their relationships; we know 
what leads to what. 

A problematical situation represents an unstructured region of life 
space. We do not know how to get from the givens to the goal. We feel 
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insecure until the region becomes structured. When it does become struc- 
tured so as to permit problem solution, we have learned. 

A change in the structure of knowledge (cognitive structure) may 
occur with repetition. The situation may require repeated exposures before 
the structure gets changed. The important thing is that the cognitive 
structure gets changed, not that the repetitions occur. With better arrange- 
ments of the problem the structure may get changed with fewer repetitions. 
This is the lesson of the insight experiments. Too much repetition does 
not aid learning; on the contrary, repetition may lead to psychological 
satiation 22 with accompanying disorganization and dedifferentiation of 
the cognitive structure. 

Changes in cognitive structure come about, in part, according to the 
principles of patterning in perception, as previously discussed. These 
changes are due to "forces” intrinsic to the cognitive structure. But 
cognitive structure is also changed according to the needs of the individual. 
A psychological force corresponding to a need can have either of two 
consequences. It can lead to "locomotion” in the direction of the force. 
This means, in common language, that the need is satisfied in familiar 
ways; new learning is not required, and the cognitive structure may remain 
intact. Or the force can lead to a change in cognitive structure so that 
such a "locomotion” may be facilitated. That is, the relationships within 
the situation are seen in new ways, so that the need may be satisfied. In 
the latter case, motivated learning may be said to have taken place. 

Reward and Punishment 

Those who accept law of effect or reinforcement theories have seldom 
analyzed in detail what the circumstances are which force the learner to 
confront the reward or the threat of punishment. Lewin’s treatment of 
the conflicts involved is recognized by at least one of his critics as an 
advance over traditional law-of-effect thinking. 23 

The usual arrangement in punishment is illustrated in Figure 41. 
The threat of punishment (Pu) is used to keep the learner at an intrin- 
sically disliked task (T). A conflict is set up so that the individual is forced 
to choose between one or the other of the disagreeable possibilities. Under 
these circumstances, the tendency is to "leave the field,” to avoid both 
of the tasks. In order to keep the learner in the field of the conflict it is 
necessary to erect barriers (B). These in actual life are usually authoritar- 
ian; it is necessary to "police” a learning situation controlled by punish- 

2 - Karsten ( 1928). 

23 Estes { 1954b), page 340. 
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ment Because association psychologists have tended to perform their 
experiments in confining runways or with subjects strapped to the ap- 
paratus, they have usually overlooked important dynamic arrangements 
of their experiments. The importance of the barrier is not incorporated 
in their theories. 
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FIGURE 41. Life space in threatened punishment. The person is caught between 
the tendency to avoid an unpleasant task (T) and the threat of punishment (Pu). 
Because of the conflict there will be a tendency to leave the situation (1) unless 
there are strong barriers to prevent this. Hence the punishment situation to be effec- 
tive, is necessarily prison-like. After Lewm (1942), page 233, by permission of the 
National Society for the Study of Education. 

In a reward situation it is not necessary for the learner to be “walled 
in” because the attractiveness of the reward keeps him in the field. It is 
necessary, however, to keep a barrier around the reward, to prevent access 
by any route (such as W) other than by going through (performing) 
the disliked but requested activity (Figure 42). Because the reward is 
externally related to activity T, there will always be a tendency to take 
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FIGURE 42. Life space in offered reward. The person is in conflict because the 
approach to reward (R) is by way of the unpleasant task (T). While the tendency 
to leave the field is not present as it is in punishment, there is a tendency to approach 
the reward by way of a shortcut so as to avoid T, as by path W*,»,k. Hence it is 
necessary to guard access to the reward to prevent approach by any path other than 
that through T. Some policing is necessary, but the situation is less prison-like than 
in the case of control by punishment. After Lewm (1946), page 811, reproduced 
by permission from Manual of child psychology edited by L. Carmichael, published 
by John Wiley & Sons, Inc. 
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short cuts, if possible. Cheating by school children is such an effort to 
obtain the reward without performing the demanded work. 

The differences in structure between reward and punishment situa- 
tions are real. One consequence of the different amounts of constraint in 
the two situations is that rewarded activities often become interesting and 
liked, so that motivation is no longer extrinsic, whereas activities controlled 
by the threat of punishment tend to become increasingly hated. 24 

Success and Failure 

When goals are intrinsic, it is more appropriate to think of the goal 
activity as successful or unsuccessful rather than as rewarding or punishing. 

The contrast between a “psychological” approach to psychological 
problems and a “non-psychological” approach is nowhere clearer than 
in the difference in treatment of reward and success. A reward is some- 
thing tangible and external, which merely terminates a situation (Guthrie) 
or is associated with need reduction (Hull). A success experience has to 
be understood according to what the learner is trying to do, and the 
relationship between success and goal-achievement is a somewhat com- 
plex affair. Psychological analysis of success from the point of view of the 
learner shows at least the following possibilities: 25 

1. To reach a goal constitutes success. This is the usual interpretation 
of success. It is obviously satisfying to try for something and to achieve it. 

2. To get within the region of the goal may be a success experience. 
Some goals are less well defined than others. A student may hope for an 
A grade, but be satisfied with a R; a person may wish an office, but take 
satisfaction in the fact that his fellow-members nominated him. The 
curious practice in India of signing a letter with the statement following 
the signature “B.A., failed,” testifies to the fact that getting near enough 
to the goal to take the examination is a mark of esteem. 

3. To make noticeable progress toward a goal may provide a success 
experience, even though the goal is remote. To pass freshman physics 
is not to be admitted to medical school, but it is a step along the way. 

4. To select a socially approved goal may in itself be a success ex- 
perience. To be the kind of student who carries books home over the 
Christmas holidays is a mark of serious intent and is bolstering to the ego, 
even though the books remain unopened. To own an encyclopedia is to 

24 Lewin (1942), page 233. For a fuller discussion of promised reward and threat- 
ened punishment, see Lewin (1935), pages 114-170. 

- >8 Hilgard { 1942 ) . 
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be the kind of person you would like to be, whether or not the encyclo- 
pedia is consulted. 

If such matters are talked about in the language of primary and 
secondaiy reinforcements, some of the vividness of the learner-goal re- 
lationship is bound to be lost. It is to Lewin's credit that he made a serious 
effort to deal with psychologically real problems, and then, as a second 
step, tried to find a mathematics appropriate to them. Too often the al- 
ternative is chosen of looking first for something which can be measured 
and counted, hoping that psychologically more interesting relationships 
can be deduced from them later on. This leads to preoccupation with 
nonsense syllables, mazes, and conditioned responses, to the neglect of the 
problems of behavior in freer life situations in which the learner selects 
his own goals and redefines them as he goes along. 

The experiments which pointed up the concepts used by Lewin and 
his students in discussing success and failure were those of Hoppe (1930) 
which introduced the companion notions of ego involvement and level of 
aspiration . 

Psychological success and failure depend upon ego involvement in 
the task at hand. That is the goals must be real to the learner, so that, if 
achieved, there is the elation of significant accomplishment; if not achieved, 
there is a chagrin or humiliation of defeat. As William James put it, our 
self-esteem may be expressed as the ratio between our success and our 
pretensions. 20 

Hoppe pointed out that there is a restricted region of difficulty within 
which success and failure are possible (Figure 43). Some tasks are much 
too easy. To succeed in them is not to experience success. A school child 
is insulted if asked to spell words which are too easy or to make computa- 
tions interpreted as '‘baby stuff.” Some tasks are much too hard. If we 
are asked to give a construction in Russian grammar — and we make no 
pretense of knowing Russian — this is not a psychological failure. It is 
only within the range of uncertainty — where both success and failure 
are possible — that we can really succeed or really fail. Experienced teachers 
know how hard it is to keep tasks at an appropriate level of difficulty so 
that the learners remain ego-involved. 

Individuals tend to set momentary goals within the range of activities 
in which there is ego involvement. The momentary goal is referred to as 
the level of aspiration . It is set according to the learner's interpretation 
of his own achievement, but there are wide individual differences in 

26 James ( 1890), page 1, 310. 



278 THEORIES OF LEARNING 

the manner of such goal-setting. Some learners are realistic, and set their 
goals near to what they have shown themselves capable of doing. Those 
who are unrealistic may depart in either direction from their past achieve- 
ments. Some are self-protective, and to avoid the possibility of failure set 
goals too low; others are hopeful, and set goals unrealistically too high. 27 
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FIGURE 43. Difficulty-level and the experience of success and failure. The psycho- 
logical experience of success and failure can occur only in the region in which there 
is some possibility of objective achievement or lack of achievement. By permission 
from A dynamic theory of personality, by K. Lewin, Copyright 1935 by McGraw- 
Hill Book Co., Inc. 


The interrelationships among past achievements, momentary goals, 
social atmosphere, and individuality, set a number of inviting experimental 
problems. 28 The flavor of these studies is legitimately compared with that 
of studies on reward, punishment, and reinforcement, which represent the 
approach to the same subject matter taken by the stimulus-response psy- 
chologists. Regardless of how one may view the conceptual system within 
which the level-of-aspiration studies are framed, it is evident that they 
call attention to aspects of goal-directed activities which have been neg- 
lected by those preoccupied with reward and punishment. 

27 P. S. Sears (1940) (1941). 

28 For reviews of experiments concerned with level of aspiration, see J. D. Frank 
(1941) and Lewin, Dembo, Festinger, and P, S. Sears (1944). 
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Learning as a Change in Motivation 

The repetition of an activity brings changes both in the cognitive 
structure and in the need-tension systems. Often a desirable outcome is 
a change in the interests or values of the learner, that is, a change in the 
relative attractiveness of one goal over another. This goal-attractiveness is 
what Lewin calls valence , and valences change. 

The following three cases of valence-change as a result of repetition 
are to be distinguished: 

1. Attractive goals may lose their attraction if the activity related 
to them is repeated to the point of satiation. Monotony and boredom are 
familiar enough experiences, but their dynamics are by no means simple. 
The suggestion that every activity presents a barrier to its own repetition 
(one of Hull's postulates) does not suffice to account for the details that 
are found. For example, when tasks are graded as pleasant, unpleasant, or 
indifferent and then repeated excessively, the unpleasant tasks, as might 
be expected, are first satiated. 29 But it is the pleasant tasks which next 
become impossible to repeat, and the indifferent tasks are consistently 
last. Because of less ego involvement in the indifferent tasks, their endless 
repetition does not develop the resistances found for either pleasant or un- 
pleasant activities. Psychological satiation is not to be confused with 
fatigue, even though fatigue symptoms are prominently present. Karsten 
showed that the symptoms often disappeared with a change of set, even 
though the same muscular activities might be involved in the new task. 
For example, a subject who had to quit a line-drawing task because of 
inability to hold the pencil any longer was asked to write out a description 
of the psychological processes which had been involved in the perform- 
ance, and was able to pick up the pencil and write about the performance 
with ease, including demonstrations of the task which no longer could 
be performed! 

2. Goals originally unattractive may become acceptable through a 
change in the meaning of the goal-related activity. There are many teach- 
ing techniques which depend upon this principle. Lewin gives a number 
of examples. 30 One is the changing of food preferences by telling a story 
in which the disliked food is the favorite for the hero. 81 This is a common 
radio device for selling breakfast food. Another illustration is the changing 
of ideologies, as in the experiment of Bavelas (1942) in which relatively 

29 Karsten (1928). 

80 Lewin (1942), pages 235-238. 

81 Duncker (1938). 
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autocratic recreation leaders were changed into democratic ones. It was 
necessary for the cognitive structure of the field, “leadership behavior,” to 
undergo changes so that what they were trying to do as leaders became 
changed. 

3. The choice of goals is influenced by previous experiences of suc- 
cess and failure. This is most clearly shown in the experiments of Sears 32 
in which she found that realistic goal-setting was characteristic of those 
who had experienced success. It is possible to attenuate the bad con- 
sequences of failure by teaching more appropriate adjustments to it, thus 
raising children who can tolerate frustration. 88 Rut this does not alter the 
fact that in addition to the problem of how goals affect behavior there 
exists the very important problem of how goals are chosen. 

MEMORY FOR FINISHED AND UNFINISHED TASKS 
AS ILLUSTRATIVE OF EXPERIMENTS SUGGESTED 
BY LEWIN’S THEORY 

The facts of amnesia show that forgetting and retention are incom- 
pletely understood according to our ordinary theories of forgetting through 
disuse or retroactive inhibition. That there are motivational and emo- 
tional factors at work has been emphasized in the Freudian theory of 
repression. 34 It is in this context that Zeigarnik’s (1927) experiment on 
the recall of finished and unfinished tasks is to be considered, for the 
experiment dramatically relates forgetting to tension systems within the 
individual as a demonstration of Lewin’s (1926) theory. 

The subjects (variously students, teachers, children, 256 in all) were 
given simple tasks to perform. In the individual experiments there were 
usually 18 to 22 such tasks: modeling a figure from clay, laying out playing 
blocks in certain forms, solving simple puzzles, and so on. The experi- 
menter interrupted the subject on half the tasks, waiting until the subject 
was deeply engrossed in it so that it would be a psychologically real in- 
terruption. Which tasks were interrupted differed from subject to subject 
according to conventional experimental practice. 

After “tidying up” in order to remove all tools and other reminders 
of specific tasks, the experimenter asked the subject: “Please tell me 
what the tasks were upon which you worked during this experiment.” 
Often a number of tasks were mentioned, then a pause would occur. After 

32 P.S. Sears (1940) (1941). 

33 Keister (1937). 

34 E.g., B. R. Sears (1936a), Rapaport (1942) (1950). 
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considerable effort a few more tasks might be recalled. For purposes of 
scoring, only those tasks were counted which were readily recalled, that 
is, before the pause. The difficulty of such recall is greater than might be 
supposed. In one series, 32 subjects recalled between 7 and 19 tasks out 
of 22 worked on, averaging 11, or just half the total; in another series, 
14 subjects recalled between 7 and 16 of 20 tasks, averaging 10, again half 
the total. 

When the relative success of recall for completed or interrupted tasks 
was examined, it was found that of the first 32 subjects, 26 remembered 
interrupted tasks better than completed ones, 3 remembered both equally 
well, and 3 remembered the completed tasks better. 

A scoring ratio was devised to indicate the relative superiority of in- 
terrupted tasks recalled (I) over completed tasks recalled (C). If both 
were equally recalled, the ratio would be I/C = LOO. In the first experi- 
ment the ratio turned out to be 1.9, indicating that the interrupted tasks 
were recalled 90 per cent better than the completed ones. 35 Most of the 
other ratios obtained from groups were in the same direction, that is, 
above 1.0, although there were wide individual differences. 

Zeigarnik went on to test several hypotheses to account for the greater 
retention of the unfinished tasks, and also to determine the circumstances 
under which finished tasks would be better recalled. 

The first hypothesis tested was that the attention value of the task 
might be enhanced by the shock of interruption. When, however, the task 
was interrupted early in the hour and completed later, it was forgotten 
like any other completed task, thus contradicting the hypothesis. 

A second hypothesis was that the subjects expected the interrupted 
tasks to be resumed and therefore made special note of them. However, a 
test situation in which one group was told that the tasks would be re- 
sumed, and another that they would not, failed to result in a difference 
between the groups. 

The remaining hypothesis, the one accepted, is that a task not com- 
pleted leaves a state of tension, a quasi-need. Completing the task means 
resolving the tension or discharging the quasi-need. The advantage in 
memory of the unfinished task is due to the continuation of the tension. 

The advantage of unfinished tasks is not to be universally expected. 
Zeigarnik pointed out at least eight conditions under which the relative 
advantage is lost: 

1. If a task, finished to the experimenter's satisfaction, is considered 

33 Marrow (1938) has pointed out that the ratio of 1.9 is too high because it is an 
average of ratios. Corrected as a ratio of tasks recalled it is reduced to 1.6. 
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inadequately completed by the subject, it may be recalled as well as an 
interrupted task. 

2. Tasks which are without a clear-cut completion stage, such as 
marking Xs on a sheet of paper, have no advantage in memory following 
interruption. Interruption is in such cases not a psychological incomplete- 
ness; it is the mere calling of a halt to an endless activity. 

3. Subjects who do not become at all involved in the individual tasks, 
but act according to command, remember finished and unfinished tasks 
equally well. A group of visiting school children who were set to work 
when they had come merely to see the laboratory were obedient in this 
way, and their recall ratio was I/C = 1.03. 

4. If the tasks are seen as leading to the satisfaction of a single need, 
so that the boundaries between them are weakened, the individual effects 
of interruption are lost. One small group of subjects became engrossed 
in discovering the meaning of the experiment, so that the individual tasks 
were imbedded in this search for the secret of the experiment. The ratio 
was I/C = 1.12. Another group of subjects was shown in advance all 
the tasks that were to be solved in the hour. By reducing the separateness 
of tasks the difference between interruption and completion was lost, with 
I/C = 0.97. 

5. If subjects were very tired when performing the experiment and 
rested when tested, or rested while performing and tired when tested, 
the memory advantage of unfinished tasks was lost; in fact, for the tired- 
fresh group, the finished tasks were better recalled (I/C = 0.61). An in- 
genious explanation is offered. It is said that the tension-system cannot 
be maintained in a loosened, relatively fluid medium, hence memory for 
the uncompleted tasks is poor in a tired condition. On the other hand, 
under such circumstances relatively stable systems (finished tasks) per- 
sist in spite of the state of fatigue. 36 

6. After delays of 24 hours, the I/C ratio drops to about 1.00, so that 
there is no longer an advantage for the interrupted task. This is interpreted 
as due to intervening activity, not to time alone, because of the next ob- 
servation. 

7. If a highly emotional episode is introduced between the tasks and 
the retest, the same distortion occurs as with the passage of time. Six 
subjects tested in this way all showed better recall of finished tasks. 

36 The tests were made after 13-15 hours, when the relative advantage of the inter- 
rupted task might have been lost in any case. If the conjecture holds, immediate reten- 
tion after performing while tired should favor the completed tasks. 
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8. A task which the subject found too difficult was often forgotten, 
even though it had been interrupted. The situation is considered to favor 
repression. 37 

The practical implications of Mrs. Zeigarnik’s findings are somewhat 
obscure. In the first place, the memory for the tasks performed is a very 
different matter from memory for what was learned or how to perform 
the task next time. It would be a mistake to assume that children should 
not be permitted to complete assignments, lest they forget them. They 
might forget what they did in the library on Tuesday afternoon and still 
remember what they found out there. 38 In the second place, the phe- 
nomenon is so temporary, and so easily disturbed by other experiences, 
that its practical meaning is uncertain. It may be that the temporary nature 
of the effects is due to the triviality of the tasks; if that is so, it is important 
to have a comparable demonstration made with more significant tasks. 

One of the more revealing of the later studies concerned with the 
problem of recall of the task is that of Marrow (1938). He explained that 
he was trying out different sorts of materials, and that if he saw the sub- 
ject was getting along all right he would stop him and give him a new 
task. Only if the subject was taking unusually long, or having trouble, 
would he be permitted to finish the task. Under these circumstances, inter- 
ruption was psychologically interpreted as success, non-interruption as 
failure. Recall was found to be higher for the completed tasks, that is, 
those which caused feelings of failure. While wholly in agreement with 
the position that ego-related aspects of the performance are important in 
the recall, the tension set up is not quite that of task completion, although 
it may be related to it in the form: “If only I had another chance at it!” 

The type of investigation started by Zeigamik has proved a stimulus 
to a great deal of future work. 39 Some of the subsequent studies have been 
concerned with a slightly different problem — the resumption of the in- 
terrupted tasks. It is evident from these studies that correlated with the 
memory for the task is a tendency to resume it when the opportunity next 

37 For a list of 11 variables affecting the results in experiments of this kind, see 
Prentice ( 1944), page 338. 

38 McKinney ( 1935), however, found a tendency for the substance of an interrupted 
task (maze learning) to be better retained than "the substance of a completed task. 
Nuttin (1953), pages 341-351, also has shown memory for the substance of the un- 
finished tasks rather than retention for the names of the tasks. 

39 For useful summaries see Prentice ( 1944), Glixman ( 1948), McGeoch and Irion 
(1952), pages 390-394, Alper (1952). Representative studies include Abel (1938), 
Alper (1946) (1948), Atkinson (1953), Glixman (1949), Martin (1940), Paehauri 
(1935), Rosenzweig (1933) (1941) (1943), Rosenzweig and Mason (1934), San- 
ford ( 1946). 
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arises. 40 These studies are consonant with the Lewin interpretation that 
a quasi-need has been set up as a result of the interruption. 

ESTIMATE OF LEWIN’S POSITION 

Lewin on the Typical Problems of Learning 

Because his was not essentially a psychology of learning, it is difficult 
to assign Lewin a position on each of the standard problems. A large 
measure of conjecture necessarily enters. 

1. Capacity . The life space of an adult is more highly differentiated 
than that of a child. Similarly, the life space of an intelligent person is 
structured more highly than that of a less intelligent person. There are 
also differences in fluidity or rigidity. 41 

2. Practice . Learning may take place with repetition because the 
change in cognitive structure or in motivation may require repetition. 
There is, however, no one-to-one relationship between number of trials 
and the changes which constitute learning. 

3. Motivation. The differences between reward and punishment are 
well represented in life-space diagrams, which point out the difference in 
the amount of policing required in the two situations. This aspect of re- 
ward and punishment has been neglected by all the other writers con- 
sidered. For some purposes it is preferable to speak about success and 
failure rather than about reward and punishment. Then the concepts of 
ego-involvement and level of aspiration become important. Cognitive 
structure is both activated by and changed by aroused needs or tensions. 
Motivation is therefore of central importance within the theory. 

4. Understanding. Because one of the chief characterizations of learn- 
ing is change in cognitive structure, knowledge and understanding lie 
at the heart of learning. 

5. Transfer. Probably the gestalt concept of transposition is acceptable 
to Lewin. 

6. Forgetting. Motivated forgetting is important. The dynamics are 
rather complicated, with interruption in an ego-involved task leading 
usually to better retention and to attempts at resumption when the oppor- 
tunity is offered. However, something like repression is accepted, in the 
forgetting of too difficult tasks. 

40 Related studies include Adler and Kounin ( 1939 ) , Cartwright ( 1942 ) , Nowlis 
(1941), Ovsiankina (1928), Rethlingshafer (1941), Rickers-Ovsiankina (1937). 

41 Kounin (1941). 
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The Emphasis upon Motivation 

The relationship of motivation to learning has been increasingly 
recognized by stimulus-response theorists, but their usual solutions of 
the problem have not been well adapted to deal with the complexities of 
goal-setting, conflict, and problem-solving. The two common solutions 
have been that of the law of effect (and related reinforcement theories), 
and the addition of set and determining tendencies to ordinary associative 
processes. Lewin has made a real contribution by his more dynamic, in 
some real sense more ‘‘psychological,” approach to the problem. 

Historically there were others making similar proposals. Lewin was 
undoubtedly influenced by Freud, whose work he thought brilliant but 
methodologically unsound. 42 In America, McDougall 42 was the arch op- 
ponent of behaviorism until gestalt psychology came upon the scene. 
McDougall often inveighed against the errors of hedonism (in which he 
classed the law of effect) in placing too much emphasis on what happened 
in the presence of the goal, and too little on the satisfaction which derived 
from the striving. Woodworth, whose Dynamic psychology (1918) gave 
currency to the notion of drives, included the conception of activity itself 
becoming a drive. This is an anticipation of Lewin’s tension theory, al- 
though Lewin probably was not influenced by it. While McDougall and 
Woodworth were not gestalt psychologists, they saw some of the limita- 
tions in contemporary learning theory which Lewin also saw and at- 
tempted to meet through his experiments and theories. 

A satisfactory psychological theory of motivation is almost certain to 
include some kind of ego reference, that is, some incorporation of the 
present activity within the larger goals, values, or ambitions of the 
learner. 44 Lewin’s motivational theories definitely take account of such 
problems. 

Other Aspects of Lewin s System 

The problems of learning can no longer be neatly segregated from the 
other problems of psychology and limited to the kinds of data that can be 
plotted in the form of learning and retention curves. Hence it becomes 
increasingly difficult to determine what aspects of a psychological system 
may become relevant for learning. In the foregoing account of Lewins 
work, emphasis has been placed upon those features that overlap, at least 

42 Lewin ( 1937). 

« E.g., McDougall (1923). 

44 The problem was discussed by the contributors to a symposium on the ego and 
the law of effect, Allport ( 1946), Mowrer ( 1946), and Rice ( 1946). 
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in topic, with what other learning theorists discuss. Here brief notice will 
be taken of certain other features of his system, more remote from the usual 
topics dealt with in the psychology of learning. 

One set of Lewin’s interests might be designated as the dynamics of 
child behavior. This led to the inclusion of chapters by him in successive 
editions of handbooks of child psychology 45 and the collection of a book 
of readings on child development by authors much influenced by him. 46 
With prominence given to childhood as a period of learning, interest in 
child development becomes increasingly important for learning psycholo- 
gists. 

A second set of interests was social psychology, especially group dy- 
namics. An early disciple wrote a social psychology using many topological 
diagrams. 47 A number of Lewin s own papers were collected in a post- 
humous volume (1951). Again, as the influence of the group upon the 
learning of the individual comes more into focus, studies of this kind may 
become more significant for learning. 

Related to the interest in group dynamics was a methodological in- 
terest in action research , a technique for getting things done, while at the 
same time criticizing the process along the way. His papers in this field 
have also been collected and published in a small volume with the title 
Resolving social conflicts (1948). The applied psychology of learning can 
profit from the adaptation of some of the principles of action research, for 
example, in encouraging teachers to use newer methods and in gaining 
the support of the community for these methods. 48 

Is Field Theory in Learning Related to Physical 

Field Theory? 

Critics of Lewin have objected to a certain gain for the prestige of his 
theory because he used words such as “dynamic” and “field,” which bring 
with them implications that the theory is unusually up-to-date and modern, 
and that other theories are old-fashioned and stodgy. 

There are two major criticisms of his identifying his theory as a field 
theory: 

1. If he means simply that he recognizes multiple causation, then his 
theory is no more a field theory than anyone else s. Any sophisticated 
theory recognizes that behavior is a function at once of a number of 
variables. 

45 Lewin ( 1933a ) (1946). 

46 Barker, Kounin, and Wright ( 1943), 

47 J. F, Brown (1936). 
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2. If he means that his theory is a field theory similar to a physical 
field theory, then his mathematics ought to correspond more nearly to 
that used by physicists. 

Lewin answered the first criticism by saying that “the recognition of a 
multitude of factors as determining an event, and even their representa- 
tion as a phase space, does not presuppose field theory .” 49 He points out 
that Thurstones factor analysis fits the notion of multiple causation but 
is not a field theory. 

Lewin believed that his theory did correspond closely to field theory 
in physics. As it applies to psychology, he stated: 

One of the basic statements of psychological field theory can be formulated 
as follows: Any behavior or any other change in a psychological field depends 
only upon the psychological field at that time.* 0 

He believed that the equivalent of the differential in physics ( dx/dt ) 
was his concept of “behavior,” which, for him, covered any change in the 
psychological field. But this is an inexact statement, for his equations do 
not actually make use of differentials. Hence the precise relationship be- 
tween psychological fields and physical fields is indeterminate. As Estes 
has put it: 

The main conclusion from our analysis of Lewin s methodology is not that 
field theory has been found unsuitable in the treatment of learning but rather 
that the issue is still open because Lewin did not construct a field theory. 51 

His mathematical representations have been criticized, apart from 
their relationship to field physics. Lewin’s use of vectors and valences 
leading to locomotion in life space represents a courageous attempt to 
find a mathematics with a natural ‘‘fit” to the kind of data with which he 
deals. The rather unsatisfactory state of this mathematical structure has 
been pointed out not only by those impatient with his approach, 52 but 
by those friendly to his point of view. 53 But the difficulties are inherent in 
the early stages of theory construction and need not prevent appreciation 
of Lewin s positive suggestions. 

His general conception of the relationship between mathematics and 
the data of a science is sound. There is no question that some forms of 
mathematics are more appropriate than others. The geometry of spaces 
does usefully represent alternative routes, conflict, ambivalence, barriers, 
and many other psychologically real situations. It is the incorporation of 

49 Lewin ( 1943), page 293. 

50 Lewin ( 1943 ), page 295. 

51 Estes (1954b), page 342. 

52 E.g., London (1944). 

63 E.g., Leeper ( 1943). 
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metrical concepts of distance, direction, and the like which causes the 
most trouble. The usual diagram also best represents the momentary life 
space. The conversion of this diagram into a third dimensional figure 
showing the changes in time — with which learning theories are commonly 
concerned — introduces some awkwardness. As in the case of Hull’s math- 
ematical model, a great deal of tinkering is possible in smoothing out 
the difficulties without destroying the essence of what it is trying to 
convey. 
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Chapter 9 

FREUD’S PSYCHODYNAMICS 


S IGMUND FREUD (1856-1939) so influenced psychological thinking 
that a summary of theoretical viewpoints, even in a specialized field 
such as the psychology of learning, is incomplete without reference to 
him. In the context of the history of psychology, Boring has this to say 
of Freud’s place: 

It was Freud who put the dynamic conception of psychology where psy- 
chologists could see it and take it. They took it, slowly and with hesitation, ac- 
cepting some principles while rejecting many of the trimmings. It is not likely 
that the history of psychology can be written in the next three centuries without 
mention of Freud's name and still claim to be a general history of psychology. 1 

It is no simple task to extract a theory of learning from Freud’s writ- 
ings, for while he was interested in individual development and the kind of 
re-education that goes on in psychotherapy, the problems whose answers 
he tried to formulate were not those with which theorists in the field of 
learning have been chiefly concerned. Psychoanalytic theory is too com- 
plex and, at least at the present time, too little formalized, for it to be 
presented as a set of propositions subject to experimental testing. There- 
fore, instead of attempting an orderly exposition of a psychodynamic 
theory of learning, we shall rest content with examining in somewhat 
piecemeal fashion suggestions from psychoanalysis that bear upon learn- 
ing. Some of the suggestions have already influenced experimentation in 
learning, and some of them may be more influential in the years ahead. 


PARALLELS BETWEEN PSYCHOANALYTIC THEORY 
AND CONVENTIONAL INTERPRETATIONS OF 
LEARNING 

One can find in Freud’s writings a number of statements that parallel 
very closely statements made by contemporary learning theorists little 
1 Boring ( 1950), page 707. 
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influenced by him. The common topics within psychoanalysis and learning 
theory serve usefully to show how relevant psychoanalysis is to learning. 

The Pleasure Principle and the Law of Effect 

Hedonistic theories — that man seeks pleasure and avoids pain — are 
among the oldest interpretations of human conduct, and any theory of 
learning must come to grips with the facts to which they refer. There is 
no doubt that we can control learning by way of reward and punishment. 
The details sometimes elude us, but the gross facts cannot be doubted. 

Freud’s pleasure principle is in accord with these facts, and his in- 
terpretation of the pleasure principle represents one of the first points 
of correspondence between his views and those of learning theorists. The 
corresponding principle in contemporary learning theory is more likely to 
go under the names of the law of effect or reinforcement theory . The broad 
conception, common to both psychoanalysis and learning theory, is that a 
need state is a state of high tension. Whether we describe this in terms 
of instincts seeking gratification or of drives leading to consummatory 
responses, we are talking about the same sequence of events. What con- 
trols the direction of movement is a tendency to restore a kind of equilib- 
rium, thus reducing tension. Freud talked about a return to a constant 
state, and the physiologists, after Cannon, refer to homeostasis. The prin- 
ciple goes back at least to Claude Bernard, but in any case it is shared 
by Freud and the learning theorists whose views have already been con- 
sidered. 

One of Freuds concise statements of this position is found in the 
introductory paragraph of his Beyond the pleasure principle : 

In the theory of psychoanalysis we have no hesitation in assuming that the 
course taken by mental events is automatically regulated by the pleasure prin- 
ciple. We believe, that is to say, that the course of those events is invariably set 
in motion by an unpleasurable tension, and that it takes a direction such that 
its final outcome coincides with a lowering of that tension — that is, with an 
avoidance of unpleasure or a production of pleasure.- 

Freud speculated about the relationship between pleasure and quan- 
tity of excitation. Unpleasure, he said, corresponded to an increase in 
excitation and pleasure to a diminution , although he allowed some impor- 
tance to the rate of change of excitation. In this he followed a classical 
psychologist, Fechner, who wrote along the same lines much earlier.® 

Until lately, American psychologists have preferred to talk about 
tension and its reduction without bringing in the term ‘‘pleasure.” More 

2 Freud (1920b), page 1. From Beyond the pleasure principle by Sigmund Freud, 
by permission of Liveright Publishers, New York. 

2 Fechner (1873). 
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recently, however, attention is again being paid to affectivity. 4 McClelland 
and his associates ( 1953) have proposed a quantitative theory of pleasant- 
ness and unpleasantness, somewhat along the lines suggested by Freud. 
They propose that a slight departure from the level of stimulation to which 
the organism is adapted is usually pleasurable, while an extreme departure 
is unpleasant or annoying. Thus if one adds salt to distilled water, small 
quantities make the taste more acceptable, but large quantities make it 
less acceptable. Freud suggested that this was a field open to experimenta- 
tion, and indeed it is. 5 

If we are to move to more sophisticated and refined theories of learn- 
ing, it is imperative that we pay attention to the differences between 
proposals as well as to their similarities. There is an important aspect of 
Freud’s tension-reduction theory less well represented in contemporary 
law-of-effect discussions. This is the basic principle that an aroused need 
that remains unsatisfied produces a fantasy of the goal-object that could 
satisfy the need. That is, tension reduction that is not accomplished 
through actual gratification is accomplished through wish fulfillment , in 
the form of a hallucination of the satisfying object. At the id level this 
fantasy is confused with reality, just as the thirsty man on the desert 
confuses the mirage with a true oasis. In the primary process , under the 
control of the pleasure principle, wish fulfillment is hallucinatory; the 
later, more realistic expectation of future events comes with the develop- 
ment of the secondary process , when memory images become substituted 
for hallucinations. 

The problem of hallucinations has not been entirely neglected by 
learning theorists. Mowrer (1938) proposed the study of hallucinations as a 
means of investigating certain aspects of expectancy. Dollard and Miller, 
treating perception as a response, and a hallucination as a learned per- 
ception in the presence of minimal or ambiguous cues, accept the fact that 
hallucinations are more likely if drives are strong. They cite evidence 
for their position. 6 

The first area of correspondence, then, between learning theories and 
psychoanalysis, is the family resemblance between the tension-reduction 
interpretation of the law of effect (reinforcement theory in one of its 
forms) and the pleasure principle, with the caution, however, that learning 

4 E.g,, Young (1952). 

5 Freud ( 1920b ) , page 2. 

6 Dollard and Miller (1950), pages 178-181. The evidence is from McClelland 
and Atkinson ( 1948) that images are influenced by drives, and from Holmberg ( 1950) 
that hallucinatory images in dreams are influenced by motivation. 
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theories have not fully incorporated the fantasy-production feature of the 
Freudian principle. 

The Reality Principle and Trial-and-Error Learning 

Freud’s first supplementation of the pleasure principle was by way of 
the reality principle , whereby the organism instead of insisting upon 
immediate gratification takes the “long indirect road to pleasure.” 7 Because 
the pleasure is found at the end of the path, we may think of the psychol- 
ogist’s rat, winding its way through a tortuous maze, eliminating the 
blind alleys along the way, as being under the control of a reality principle. 

There is some ambiguity here, for Freud’s reality principle is bound 
up with the developing ego, and it would be a big jump to assign the 
rat an ego because it avoids blind alleys on the way to food. 

Psychologists have been unclear in trying to find the correspondences 
between their principles and Freud’s reality principle. 

Dollard and Miller definitely relate the pleasure principle to the law 
of effect, but they have no direct counterpart to the reality principle. Ego 
strength (related, of course, to mastery of reality) they treat in the fa- 
miliar psychological terms of higher mental processes, learned drives and 
skills. Reality, they say, is elaborated according to the physical and social 
conditions of learning. 8 

Mowrer takes a different position. He believes that the law of effect 
is related to the pleasure principle, in agreement with Miller and Dollard, 
but he would relate trial-and-error problem solution to this principle 
also, rather than to the reality principle. He believes the reality prin- 
ciple to be appropriate instead to simple Pavlovian (classical) condition- 
ing, because he believes Pavlov’s dogs learn “not what is pleasurable and 
relieving, but what is actual, true, real.” 9 He goes on to infer that such 
conditioning is limited to the autonomic nervous system, and thus leads 
himself into what appears to be a most unusual position, namely, that 
reality learning is under the control of the autonomic system, whereas 
pleasure-pain learning is under the control of the centra] nervous system. 
To some it will seem more plausible to assign instinctive, impulsive func- 
tions to the autonomic system (though I would not go that far) and to 
assign reality-regulated functions, ego functions, to the cognitive appa- 
ratus: to the central nervous system and its related structures. 

7 Freud (1911). 

8 Dollard and Miller ( 1950 ) , pages 9-10. 

9 Mowrer ( 1950), pages 5-6. 



294 


THEORIES OF LEARNING 


Let us return to animal learning as possibly representing the reality 
principle because there are many events intervening between the aroused 
tension and its ultimate resolution. A white rat can learn the true path 
in a maze, and in a discrimination apparatus it can learn to jump to a 
triangle and not to a circle. Thus it learns permitted and unpermitted be- 
havior, much in the way a child learns to behave in accordance with the 
dictates of its culture as transmitted by those in authority over him. The 
rat can learn probabilities also. If rewarded more frequently on the right 
than on the left he learns to turn more often to the right, as though to 
equate the response-reinforcement ratio on both sides . 10 This conforming 
of behavior to reality, even to a contingent reality, surely has some resem- 
blance to the earliest ego functions. Learned discrimination always in- 
volves conflict, and the conflict is resolved through the weighting of the 
choices by reward and punishment. These weights become available 
through the accumulation of experiences with the environment. Some- 
times, when discrimination is difficult, the rat mobilizes his experience 
through what Muenzinger and Tolman call “vicarious trial and error” 
(VTE), that is, a sampling of both choices by looking back or forth, or by 
making short runs first to one side and then to the other before the ani- 
mal is committed to a choice. This appears to be a kind of symbolic 
reality-testing, an analogue of ego-type behavior. 

Academic psychologists and psychoanalysts spend a good deal of their 
time studying the approach to and solution of reality problems. We may 
conclude that there is some gross correspondence in topic between what 
Freud calls the reality principle and the studies in which behavior is 
regulated largely by experienced success and failure. What is lacking 
in the co-ordination is a clear ego psychology in academic psychology to 
be co-ordinated with the ego theory in psychoanalysis. The ego psychology 
of psychoanalysis is itself only now undergoing clarification . 11 

Repetition-Compulsion in Relation to Theories 
of Habit Strength 

In his Beyond the pleasure principle , Freud introduced a number of 
highly speculative notions, with awareness that he was trying out ideas 
to which he did not wish fully to commit himself. One of the suggestions 
most closely bound to evidence from his clinical experience was that of 

10 E.g., Brunswik ( 1939). 

11 For developments within psychoanalytic ego-theory, see Erikson ( 1950), A. Freud 
(1936), H. Hartmann ( 1950), Loewenstein ( 1954), 
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a repetition-compulsion . These repetition-compulsions are believed to be 
beyond the pleasure principle, for Freud says: 

The greater part of what is re-experienced under the compulsion to repeat 
must cause the ego unpleasure, since it brings to light activities of repressed in- 
stinctual impulses. . . . The compulsion to repeat also recalls from the past 
experiences which include no possibility of pleasure, and which can never, even 
long ago, have brought satisfaction even to instinctual impulses which have 
since been repressed. 12 

Freud recognized that sometimes repeating unpleasant activities 
yields pleasure, possibly through the sense of mastery that is achieved. 
These cases he believed to be genuine, but they are still under the 
control of the pleasure principle, and are not what Freud meant by acts 
not under the control of the pleasure principle. He was not talking about 
secondary or derived pleasure; he referred to behavior that he believed to 
be independent of, and disregarding, the pleasure principle. 

In the effort to understand these repetition-compulsions, especially 
as they appeared in traumatic neuroses, Freud was led to posit a destruc- 
tive instinct (“death instinct") existing alongside of Eros (the “life in- 
stinct,” comprising self-preservation, self-love, and object-love). This is 
one of the more controversial topics within psychoanalysis itself, and some 
relatively orthodox analysts find Freuds formulation unsatisfactory. 13 

This is not the place to enter into a detailed critique of psychoanalytic 
theory, and in relating the work of academic psychologists to Freudian 
theory we must be satisfied to review some of the points of correspond- 
ence. In general, academic psychologists who have discussed repetition- 
compulsion have confined themselves to the kinds of behavior resistant to 
the usual principles of extinction. Mowrer, for example, notes the parallel 
between Hulls interest in acts that do not extinguish and the repetition- 
compulsion, and he sees a relation between repetition-compulsion and 
functional autonomy. 14 Dollard and Miller indicate that the repetition- 
compulsion is a result of cues provided by emotions that are themselves 
aroused by meaningless shreds of memories that recur in free association. 
Instrumental acts (the tendencies to repeat) are conditioned to these 
cues. 15 

Learning theorists have offered three suggestions to explain acts un- 
usually resistant to extinction: 

12 Freud ( 1920b), page 21. From Beyond the pleasure principle by Sigmund Freud, 
by permission of Liveright Publishers, New York, 
is E.g., Fenichel ( 1945), pages 59-61. 

14 Mowrer (1950), pages 9, 112 
*5 Dollard and Miller (1950), page 256. 



296 


THEORIES OF LEARNING 


1. All compulsions may conform to ordinary principles of tension- 
reduction learning. They may involve subtle forms of anxiety reduction, 
as in the defense mechanism known as “undoing.” It is admitted that 
available data are insufficient to demonstrate that this explanation holds 
in all cases. 16 

2. Overlearned activities may be resistant to change, hence lead to 
excessive repetition. Rats trained to jump a gap in order to be rewarded, 
if they repeat the activity often enough, will fail to take a direct path 
when one is provided. 17 A kind of “canalization” takes place, so that the 
familiar is preferred and the behavior remains stereotyped. 18 This has a 
family resemblance to compulsive behavior, but because the behavior was 
once a source of gratification it does not fit Freuds specifications for 
repetition-compulsion. 

3. Behavior acquired under excessive frustration may become “abnor- 
mally fixated,” and thus resistant to change. This suggestion is elaborated 
by Maier (1949) on the basis of extensive experiments conducted by him 
and his students and collaborators. If rats in the familiar Lashley jumping 
apparatus are rewarded for jumping to a cross and punished for jumping 
to a circle, they soon learn to jump to the correct figure, whether it is 
presented on the right or on the left, and the habit is flexible and can be 
reversed if reward and punishment are reversed. If, however, they are 
given an insoluble problem, with reward now on the right, now on the 
left, with no systematic relationship between the cards in the windows 
and the rewards and punishments, they develop highly stereotyped be- 
havior, such as jumping to the left, no matter what happens. The response 
occasionally persists for hundreds of trials, even though every trial is pun- 
ished. It appears that the response is no longer under control of the reality 
principle. Accessory evidence shows that the discrimination between the 
cards may be learned without affecting the fixated response. Even when 
the opposite window is left open with the food clearly in view, the rat, 
fixated on the left, will continue to jump to the left, bump its nose, and 
fall into the net. This is the nearest we come to an experimental analogue 
of compulsive behavior. 

The experimental problem has been reopened by Wilcoxon (1952), 
who repeated Maier ’s experiments with a number of controls. He found 
the main cause of the excessive fixation to be the the learning of an act 
that was occasionally punished, occasionally rewarded, when the alter- 

16 Mowrer (1950), page 112. 

17 Gilhousen ( 1931). 

18 The possible significance of “canalization” has been treated by Murphy (1947), 
pages 161-191. 
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native was invariably punished. While he interpreted his results as con- 
forming to ordinary principles of associative learning, the causal factors 
are complex. 

Compulsive behavior is a profitable field for further investigation, 
apart from trying to test in detail Freud’s hypotheses. Fenichel lists three 
types of compulsion . 10 The first type rests on the periodicity of drive 
states (instincts), so that derivatives of these states may be expected also 
to occur periodically. He believes that manic-depressive phenomena fit 
this type. The second type is a repetition due to the tendency of the re- 
pressed to find an outlet In the so-called ‘neuroses of destiny” the patient 
periodically evokes or endures the same experience. The symptoms are 
sometimes set off by “anniversaries” of significant events . 20 Compulsive 
rituals, such as compulsive counting, are said by Fenichel to fit this type. 
The third type consists in the repetition of traumatic events for the pur- 
pose of achieving a belated mastery. Children who have been the passive 
victims of frustration may seek in their games an active repetition of the 
event at a time of their own choosing, in order to gain comprehension and 
mastery. The dreams and repeated symptoms in traumatic neurosis fit 
this pattern. 

It is evident that there are problems here that have not been fully 
represented in learning experiments, although again there is overlap in 
topic, that it, between repetition-compulsion and overlearning, abnormal 
fixation, and resistance to extinction. 

These three principles from Freud (the pleasure principle, the reality 
principle, and repetition-compulsion) have been selected to show that 
there are at least some parallels between Freudian theory and the dis- 
cussions by learning theorists. It would no doubt prove fruitful to try to 
achieve greater clarity with respect to these parallels, for the interrela- 
tionships are by no means obvious and direct. 

PSYCHOANALYTIC CONCEPTIONS THAT HAVE 
INFLUENCED LEARNING EXPERIMENTS 
AND THEORIES 

In the foregoing account of parallels between Freudian theory and 
the work of academic psychologists it should be evident that the two 
lines developed rather independently. That is, studies of the law of effect, 
of overlearning, and of problem-solving were not undertaken to test 

« Fenichel (1945), pages 542-543. 

20 E.g., J. R. Hilgard ( 1953). 
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Freudian theory, and Freudian theory was not developed in order to 
"‘explain” the results obtained by experimental psychologists. There have 
been a number of topics within Freudian theory, however, that have 
definitely set problems for the experimental psychologist. We now turn 
to a review of some of these. 

Anxiety as a Drive 

Freud's views about anxiety evolved along with other aspects of his 
theory. 21 At first he thought that attacks of anxiety in his patients gave 
evidence of repressed instinctive (libidinal) excitation. That is, repressed 
libido, he thought, was transformed into anxiety. 22 Later, as he developed 
the theory of id, ego, and superego, he assigned anxiety production to the 
ego. When the ego perceives danger this perception arouses anxiety, and 
then steps are taken to reduce anxiety. Symptom formation may be one 
of these steps. Repression may also be one of the devices adopted by the 
ego, so that, instead of repression causing anxiety, we now have anxiety 
causing repression. 23 

Anxiety is closely related to fear, though fear usually has a real ob- 
ject, while anxiety may be a vague apprehension of unknown danger. 
Three kinds of anxiety can be distinguished, of which the first is indistin- 
guishable from fear: 

1. Objective anxiety (also called real or true anxiety) depends upon 
real or anticipated danger whose source lies in the external world. To be 
afraid of a poisonous snake or a holdup man is to be anxious in this way. 
True anxiety implies a real known danger. 

2. Neurotic anxiety is in regard to an unknown danger. Upon analysis 
it is found that the danger is, as Freud put it, an instinctual one. That is, 
a person is afraid of being overpowered by some impulse or thought that 
will prove harmful to him. Sometimes there is a real or threatened danger 
but the reaction to it is excessive, thus revealing the neurotic element in 
the anxiety. 24 

3. Moral anxiety is aroused by a perception of danger from the con- 
science (superego). The fear is that of being punished (belittled, de- 
graded) for doing or thinking something that is contrary to the ego ideal. 
Moral anxiety is experienced as feelings of guilt or shamer 5 

21 A useful account of the changes in Freud's conceptions is given by R. May ( 1950), 
pages 112-127. 

22 Freud ( 1920a), pages 347-349. 

22 The new views can be found in Freud ( 1926). 

24 For the distinction between these first two types of anxiety, see Freud (1926), 
pages 112-117. 

25 For a discussion of the three kinds of anxiety, see Hall ( 1954), pages 59-69. On 
shame and guilt, see Piers and Singer ( 1953). 
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Anxiety in later life was said to have two modes of origin; one, in- 
voluntary and automatic, whenever a danger situation arose, the second 
“produced by the ego when such a situation merely threatened, in order 
to procure its avoidance.” 26 

Mowrer (1939) was the first of the experimenters on learning to see 
the possibility of studying an analogue of anxiety in experiments on avoid- 
ance conditioning. He proceeded to show experimentally how rats that 
had experienced pain in the presence of certain cues could be led to make 
avoidance responses in the presence of those cues, that is, when the pain 
was merely threatened. 27 Thus he believed the anxiety-fear response to be 
the conditioned form of the pain response. Just as pain can be used as a 
drive to produce learning through escape, so anxiety can be used to pro- 
duce learning. The anxiety can be reduced by appropriate avoidance re- 
sponses just as pain can be reduced by escape responses. Hence these 
anxiety-reducing responses are learned by the ordinary reinforcement 
principle of tension reduction. Conceivably defense reactions or neurotic 
symptoms might be acquired through this familiar learning pattern. This 
kind of acquired fear or anxiety has been extensively studied as a drive 
in animal learning, especially by N. E. Miller. 28 

Mowrers views changed somewhat as he began to consider those 
features of human behavior less represented in rat behavior, especially 
man’s capacity to consider the future consequences of his acts. 29 He gradu- 
ally arrived at his own position on anxiety, at variance with Freudian 
theory. He developed what he called a “guilt theory” as contrasted with 
Freud’s “impulse theory.” 30 

The problems of anxiety, shame, and guilt lead far afield from present- 
day learning theory. The main point of contact at present is in the treat- 
ment of acquired fear as a drive. The importance of fear as a drive is thus 
summarized by Dollard and Miller: 

One of the most important drives of all is fear, or “anxiety” as it is often 
called when its source is vague or obscured by repression (Freud, 1926). There 
are three main reasons why fear is so important: because it can be so strong, 
because it can be attached to new cues so easily by learning, and because it is 
the motivation that produces the inhibiting responses in most conflicts. 31 

A somewhat different, yet related, use of the concept of anxiety in 
learning experiments was introduced by Taylor (1951), when she classi- 

26 Freud (1926), page 109. 

27 Mowrer ( 1940b ) . 

28 Miller (1948a). For a review of other related studies see Miller (1951b), Farber 
(1954). 

29 Mowrer and Ullinan ( 1945 ) . 

30 Mowrer ( 1950 ) , page 5*57. 

31 Dollard and Miller ( 1950), page 190. 
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fied human subjects in an investigation of eyelid conditioning according 
to their scores on a verbal "anxiety scale” derived from the Minnesota 
Multiphasic Personality Inventory. She found that the more anxious sub- 
jects conditioned more rapidly than the less anxious ones, and interpreted 
anxiety as drive — as the multiplier D in Hulls theory. A number of ad- 
ditional experimenters have used her scale in testing various hypotheses 
about the relationship between anxiety and conditioning. 32 

Our review shows us that in the course of experimentation theories 
of anxiety have diverged from the testing of Freud s interpretations. But 
the original introduction of anxiety concepts into learning theory was by 
those who were interested in Freudian theory, and they indicated in- 
debtedness to Freud’s writings. 

Unconscious Influences upon Word Associations 

The free-association technique of psychoanalysis has found its repre- 
sentation in psychological experimentation chiefly through the diagnostic 
word-association test introduced by Jung. Many experiments have been 
done, acknowledging the importance of the emotional processes behind 
word associations, leading quantitatively to the recognition of a number 
of "complex indicators.” 33 Although these have not had much influence 
within learning experiments, lawful relations can be found, as between 
the frequency of response words and the reaction times with which they 
are given. 34 

Word recognition, which is also a form of associative reaction to the 
printed word, has been studied in relation to the affective significance 
of the words. These studies, often referred to as studies of perceptual 
defense, are reviewed later in this chapter. 

Repression, Forgetting, and Recall 

Ever since Ebbinghaus, psychologists have made quantitative study 
of memorization and recall part of their laboratory practice. It was natu- 
ral, therefore, that Freudian interpretations of memory lapses should have 
been seized upon as appropriate for experimental testing. At first, psy- 
chologists grasped only the forgetting aspect of repression, failing to take 
into account the motivational aspects. This was due, in part, to the re- 

32 Bindra, Paterson, and Strzelecki (1955), Bitterman and Holtzmann (1952), Far- 
ber and Spence (1953), Hilgard, Jones, and Kaplan (1951), Montague (1953), Ra- 
mond (1953), Spence and Farber (1953) (1954), Spence and Taylor (1951) (1953), 
Taylor and Spence (1952), For a valuable review of both experiment and theory, see 
Farber ( 1954). The scale itself is described in Taylor ( 1953). 

33 E.g., Hull and Lugoff ( 1921 ) . 

34 Scnlosberg and Heineman ( 1950). 
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jection of the psychoanalytic theory of instincts, in part to the absence of 
motivational concepts in prevailing studies of memory and forgetting . 35 

Freud distinguished between a primal repression and a second phase 
called repression proper . 36 The primal repression consists in “a denial of 
entry into consciousness to the mental (ideational) presentation of the 
instinct.” The ideational content then remains unaltered and the instinct 
remains attached to it. “The second phase of repression, repression proper , 
concerns mental derivatives of the repressed instinct-presentation, or such 
trains of thought as, originating elsewhere, have come into associative con- 
nection with it.” The tendency to ward off these secondary derivatives 
is sometimes called after-expulsion. 

It is to be noted that repression proper assumes a primal repression, 
so that the pattern for all later repressions is set up early in life. Later 
events may be assimilated to these earlier ones. The activity of repression 
is not, however, over at the time something is repressed. 

The process of repression is not to be regarded as something which takes 
place once and for all, the results of which are permanent, as when some living 
thing has been killed and from that time onward is dead; on the contrary, re- 
pression demands a constant expenditure of energy, and if this were discon- 
tinued the success of the repression would be jeopardized, so that a fresh act 
of repression would be necessary. 37 

Sears has summarized the characteristics of a repressed instinctual 
impulse, according to Freud, in six statements: I;s 

1. It is not represented in its true form in consciousness. 

2. The instinct-presentation develops in a more luxuriant fashion than it 
would if it were conscious. 

3. The resistance of consciousness against derivatives and associations of 
the instinct-presentation varies in inverse proportion to their remoteness from 
the ideas originally repressed. 

4. Repression is highly specific to each idea and substitute idea. 

5. Repression is very mobile. 

6. The degree of repression varies with the strength of the instinctual im- 
pulse. 

The feature of repression that has appealed most to psychologists is 
its relation to pleasure and pain. Freud says, for example, that in repres- 
sion the avoidance of pain must have acquired more strength than the 
pleasure of gratification . 39 Psychologists have often interpreted repression 

35 These points were made by Sears ( 1936a ) in an able review of Freud’s theory of 
repression and the possibilities within experimentation. 

33 Freud ( 1915a), page 86. 

37 Freud (1915a), page 89. 

38 Sears (1936a), pages 242-243, based on Freud ( 1915a). 

39 Freud (1915a), page 85. The line of development began much earlier. 
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quite superficially to mean that pleasurable events will be remembered 
and distasteful ones forgotten. The first experiments used merely affec- 
tively pleasant or unpleasant materials, such as pleasant and unpleasant 
odors, words such as ‘sugar" (pleasant) and ‘quinine" (unpleasant), 
studying whether or not such materials could be easily memorized or 
retained. 40 Later studies have sought in one way or another to meet more 
squarely the demands of psychoanalytic theory. We shall return later to a 
consideration of some of them. 

Fixation 

Fixation has two closely related meanings in psychoanalytic theory. 
The first meaning is that of arrested development , so that an adult may 
be fixated at an infantile or adolescent level of psychological functioning. 
This usually implies an object-choice (the mother; a like-sexed person) ap- 
propriate to the level of fixation, and the statement is then made that 
the person is fixated at such-and-such level and fixated upon such-and- 
such an object. The second meaning is that of fixed habits leading to pre- 
ferred modes of solving personal problems, such as a fixation upon a 
particular mechanism of defense. The habit fixations discussed as forms 
of compulsion (pages 294-297) are fixations of this second kind. The two 
meanings are not clearly distinguished in psychoanalytic theory because 
the stage of arrested development is so intimately related to the style of 
life. That is, habitual modes of reacting as an adult are likely to be de- 
scribed in developmental terms, for example, the “anal character,"" the 
“oral character."" 41 

In his review of objective studies bearing on psychoanalytic con- 
cepts, Sears treated fixation as habit strength, and listed a dozen circum- 
stances shown experimentally to modify the strength of instrumental acts. 42 
While aware of the problem, he failed to treat in any detail the qualitative 
features of transition from one stage of development to the next. The 
transition depends not only on the strength of the earlier fixation but on 
the hazards involved in transition to the next stage. If the next stage is 
attractive and non-threatening, presumably the growing individual will 
enter upon it even though he showed strong attachments at the earlier 
stage. If some residue from the earlier stage makes entrance upon the 
later stage painful and anxiety-producing, arrested development may oc- 

40 E.g., Gordon (1905) (1925), Tait (1913), Tolman (1917). 

41 Fenichel (1945), page 523, writes that neurotics “are not only fixated to certain 
levels of instinctive demands but also to certain mechanisms of defense.” He also in- 
cludes “character attitudes” among the items to which they are fixated. 

Sears ( 1943), pages 81-89. 
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cur. Habits at the different levels are not all-or-nothing affairs, foi an 
adult who is grown up in some habits may be infantile in others. Ultimately 
we require a quantitative theory to account for arrested development in 
some aspects of personality and progressive development in others. 

Regression 

Regression is related to fixation in that when an act is blocked or 
frustrated, some substitute will occur. The substitute is quite likely to be 
an act once strongly fixated in the individuals repertory. Such a substitu- 
tion of an earlier habit for a contemporary one represents one kind of 
regression. 

We may distinguish three kinds of regression: 

1. Instrumental-act regression . When the organism is prevented from 
using one habit, an earlier learned habit is substituted. Many animal ex- 
periments have demonstrated this to occur. 43 These results provide an 
analogy to object-regression in Freudian theory, the kind of regression in 
which gratification is gained by relinquishing a present object and return- 
ing to an earlier one. 

2. Age-regression. Under some circumstances the person returns to 
earlier modes of behavior and obliterates future events, in a kind of re- 
vivification. If, for example, an adult who regressed under hypnosis to 
an early age began to speak in a language used in childhood but no longer 
available to him in his adult life, we could say that he had regressed 
(at least in this respect) to an earlier period in his own life. Experiments 
under hypnosis have led to somewhat ambiguous results. Occasionally 
the functioning at the suggested (regressed) age resembles very closely 
the known historical circumstances in the person’s life. 45 When regressed 
subjects take intelligence and personality tests, their performances differ 
from those of actual children at the age levels represented by the sug- 
gested regression. Hence there appears to be a certain amount of role- 
playing going on. 46 

3. Primitivation . Even though the regressed individual may not return 
either to instrumental acts once in his inventory of habits or to personality 
functioning characteristic of himself at some previous period, he may, 
under stress, show a kind of behavioral disorganization that can be ehar- 

E.g., Hamilton and Krechevsky (1933), Mowrer (1940a), O’Kelly (1940a) 
(1940b). Sears (1943), pages 89-96, has reviewed this literature. 

** The term is used by Weitzenhoffer (1953), page 191, as one classification of 
regressive manifestations under hypnosis. 

« Erickson ( 1937 ), Gill (1948), Kupfer ( 1945 ) . 

46 For specimen studies with intelligence tests, see Kline (1950), Sarbin (1950). 
For studies with personality tests, see Sarbin and Farberow ( 1952 ), Orae ( 1951 ). 
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acterized as more primitive. This kind of regression has been studied with 
young children subjected to mild frustration. 47 They showed regression 
by such indices as reduced constructiveness in play following the ex- 
perienced frustration. 

Psychoanalysts suggest that some psychotic manifestations are regres- 
sive, particularly schizophrenia. DuBois and Forbes (1934) found little 
return to the fetal posture during sleep. Cameron (1938a) (1938b) failed 
to find childish thought patterns in schizophrenic speech. Sears points 
out, however, that these studies are inadequate tests of regression because 
they do not bear adequately on the possible affective indicators of regres- 
sion. 48 


Aggression and Its Displacement 

In his later years Freud became increasingly aware of aggression, 
hostility, and destructiveness. In discussing racial tensions and other inter- 
group antagonisms he said: 

We do not know why such sensitiveness should have been directed to 
just these details of differentiation; but it is unmistakable that in this whole 
connection men give evidence of a readiness for hatred, an aggressiveness, the 
source of which is unknown, and to which one is tempted to ascribe an ele- 
mentary character. 49 

Freud recognized the importance of giving expression to hostility, 
rather than holding it back, and many contemporary analysts now give 
prominence to the role of repressed hostile impulses in neurosis. 

While Freud thus paid increasing attention to aggressiveness in his 
later writings, as something inherent in man, he had earlier suggested 
that interference with instinctual satisfaction leads to a hostile attack 
upon the source of the frustration. Symptoms of neurotic illness may be 
so directed as to cause distress to someone in the environment who is 
perceived as the agent of frustration. Building upon these suggestions in 
Freud (1917), Dollard and his collaborators formulated the frustration- 
aggression hypothesis in a manner to make possible quantitative testing. 50 
The general principle stated was that frustration leads to aggressive ac- 
tion; it was necessary later to correct the implication that aggression was 
the only (or even an inevitable) consequence. 51 The experimental evi- 

47 Barker, Dembo, and Lewin (1941). 

48 Sears ( 1944), page 315. 

49 Freud ( 1921 ), page 56, This “elementary character” he had assigned a year before 
to a “death instinct.” 

00 Dollard, Doob, Miller, Mowrer, Sears, Ford, Hovland and Sollenberger ( 1949 ) . 

51 Miller (1941), Sears (1941), These two papers are part of a symposium on the 
effects of frustration. The other papers are Rosenzweig (1941), Bateson (1941), Levy 
( 1941 ), G. W. Hartmann ( 1941 ), and Maslow ( 1941 ). 
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dence gave abundant support to the hypothesized linkage between ex- 
perienced frustration and subsequent aggressive, hostile, or destructive 
behavior. 

If the agent responsible for frustration is unknown or inaccessible 
as an object of attack (absent or protected through conflicting response 
tendencies), another object will be chosen. This is called displaced ag- 
gression. The object may be as innocent as the scapegoat which in the 
ancient Hebrew ritual was made to bear the nations sins/ 2 The word 
"scapegoat” has come into general use to refer to a victim of displaced 
aggression. 

Neal Miller and his collaborators have succeeded in relating dis- 
placement to stimulus generalization in learning theory. 53 In his original 
experiment, Miller taught rats to strike each other, as in fighting, when 
an electric shock came on. When a second rat was not present, a rat would 
"displace” the aggression to a celluloid doll or other object in the environ- 
ment. The gain to be expected from Miller’s experimental arrangement 
is that certain quantitative predictions can be made about the occasions 
for displacement, the intensity of the reaction, and so on. Miller formu- 
lated eight specific deductions from his theory, of which the following 
three are illustrative: 

When the direct response to the original stimulus is prevented by the 
absence of that stimulus, displaced responses will occur to other similar stimuli 
and the strongest displaced response will occur to the most similar stimulus 
present. 

When the direct response to the original stimulus is prevented by conflict, 
the strongest displaced response will occur to stimuli which have an intermediate 
degree of similarity to the original one. 

If the strength of the drive motivating the direct response to the original 
stimulus is increased, it will be possible for increasingly dissimilar stimuli to 
elicit displaced responses. 54 

While Miller’s experiments on displacement have been with animal 
subjects, he shows how the principles suggested may apply to human 
behavior. In human beings, discriminations tend to take the form of words, 
so that we narrow our meanings and reduce our generalizations. The 
reverse holds true also, and we use words to generalize, to name classes 
of objects that belong together. At this point, however, we are interested 
in discrimination and how it breaks down. Suppose that Mr. Brown dis- 
tinguishes among his new neighbors by calling them all by name. Because 
of a painful snub, he becomes very angry at Mr, Jones, one of these neigh- 

52 The Old Testament account of the scapegoat is in Leviticus 16:22. 

53 Miller (1948b), Miller and Kraeling (1952), Miller and Murray (1952), Murray 
and Miller ( 1952). 

54 The three statements are quoted from Miller (1948b), pages 168, 170. 
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bors. Because Mr. Jones’s social position is more secure than Mr. Browns, 
and Mr. Brown is new to the neighborhood, Mr. Brown may repress his 
anger. (This fits the suggestion that the expression of hostility may be 
prevented by conflicting tendencies. ) In Miller’s interpretation, what he 
represses is the sentence: "I am angry at Mr. Jones.” Because this sentence 
is repressed, the anger is no longer tied directly to Mr. Jones, and it is 
more likely to generalize to another neighbor because discrimination 
through names has broken down. 

Miller recognizes that the displacement problem is actually more com- 
plex than this. He points out that the name of Mr. Jones is not repressed 
when it is not in a sentence referring to anger. So Mr. Brown recognizes 
the neighbor’s dog and says to himself: “This is Mr. Jones’s dog.” Because 
the name of Mr. Jones is attached unconsciously to hostile tendencies, he 
may give the dog a swift kick, which he would not do if he recalled that 
he was really angry at Mr. Jones, not at his dog. Thus Miller has deduced 
some functional differences between consciously and unconsciously de- 
termined behavior. He has restated his deduction succinctly: 

To summarize, the repression of verbal responses specifying the source of 
the aggression may remove a basis for discrimination and allow the illogical 
generalization, or displacement, of that aggression to be mediated by a different 
verbal response which is not repressed. 55 

Some closely related observations arise out of the study of doll play 
by young children. When attention is paid to aggressive behavior (fantasy 
aggression, in this case), the choice of object follows some of the principles 
of conflict outlined by Miller. That is, where anxiety is high, the object 
of aggression is chosen as one less similar to the person who is the source 
of the aggression than when anxiety is low. Consequently, as children play 
repeatedly in a permissive environment, their choice of object gradually 
shifts toward the parents, who have commonly been the authority figures 
responsible for some of their frustration. 56 

We may conclude this section by pointing out that on these topics 
( anxiety, affectivity and verbal responses, repression and recall, fixation 
and regression, aggression and displacement) the theories of Freud pro- 
vided a stimulus to research. At the empirical level this has led to a fruit- 
ful interaction between psychoanalysis and experimental psychology. 

55 Miller ( 1948b), page 176. 

56 P. S. Sears (1951;, R. R. Sears (1951), R. R. Sears, Whiting, Nowlis, and 
P. S. Sears (1953). 
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SUGGESTIONS FROM PSYCHOANALYSIS LITTLE 
REPRESENTED IN PYCHOLOGICAL STUDIES 
OF LEARNING 

One has the feeling that a good deal of rich material from psycho- 
analysis has eluded the academic psychologist. This, if true, might be due 
either to a certain obtuseness on his part, or because the richer material 
is too difficult to bring under experimental control. The problem has been 
discussed repeatedly. Thus Sears at one time came to the conclusion: 

It seems doubtful whether the sheer testing of psychoanalytic theory is an 
appropriate task for experimental psychology. Its general method is estimable 
but its available techniques are clumsy. 57 

Kris, in a very thoughtful discussion of techniques of verification of 
psychoanalytic propositions, felt that a limitation of the laboratory was 
that the laboratory cannot produce the real dangers or deal with the basic 
needs which the genetic propositions of psychoanalysis encompass: 

The limitations of the laboratory to quasi needs (and quasi dangers) seri- 
ously restrict the area of propositions that can be experimentally verified. In 
fact," up to the present, experimental approaches have been more successful in 
dealing with propositions concerning substitution :,K than they have been with 
propositions concerning repression. 09 

Kubie, while eager to see research on psychoanalysis move forward, 
and even outlining a research institute in psychoanalytic psychology, de- 
plores a certain triviality in many of the psychologists’ experiments on 
psychoanalytic phenomena. 

It is important that any experimentalist should first make himself thoroughly 
familiar with phenomena as these occur in nature, ascertaining what can be 
proved with the unaided eye and ear before deciding what to subject to ex- 
perimental verification. . . . Experimental facilities should not be wasted on 
issues which are already clearly proved, and to which human bias alone con- 
tinues to blind us. The experimentalist should rather take up where the natural- 
ist leaves off. 00 

Learning as Related to Stages of Development 

Writers on psychoanalysis often stress that it is a genetic as well as a 
dynamic theory , 61 That is, continuities in the life of the individual must 
always be taken into account along with what is happening in the present 

57 R. R. Sears ( 1944), page 329. The antecedent of “its” is experimental psychology, 

Henle ( 1942b) is referred to here by Kris. 

59 Kris ( 1947), page 255, 

60 Kubie ( 1952), pages 64-65. 

61 H. Hartmann and Kris ( 1945). 
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At many points the theory suggests that the very young child is unusually 
susceptible to influences which leave a permanent mark upon his per- 
sonality, so that, for example, dreams throughout his life may be influenced 
by these earliest learnings. If this interpretation is true, it is important 
for a general psychology of learning, and the evidence needs to be 
specified. 

While the problems have not been formulated clearly in terms of 
learning (i.e., why and in what way the results of childhood learning are 
more permanent than later learning) we are beginning to get kinds of 
evidence supporting the gross facts of adult consequences of early child- 
hood experiences. 

Among the experiments with animals, the best known are the ex- 
periments of Hunt ( 1941 ) on the effects of infant feeding frustration on 
adult hoarding in rats, and of Wolf ( 1943 ) on the effects of sensory depri- 
vation in early life upon types of functioning under stress in later life, 
again with rats. While the results of these experiments are subject to some 
reservations, they are relevant to the genetic theory that early experiences 
may show their influences in the conflicts of later life. 

Some conjectures about child-training practices and personality in 
adult life have been put to the test by Whiting and Child (1953). They 
went to the cross-cultural files, where anthropological reports on upwards 
of 200 cultures are summarized according to a great many categories. 
Then they asked certain questions about childhood practices and de- 
veloped hypotheses about their consequences for adult experiences in 
those cultures. To take one illustration, they predict that socialization 
anxiety established in early childhood should affect the interpretation of 
the causes of illness in adult life. Even in our own culture, with the em- 
phasis upon the germ theory of disease, we have many subordinate in- 
terpretations of illness: "It must have been something I ate,” “I’ve been 
working too hard,” “You have to suffer for your sins.” Our remedies, too, 
show something of the magical quality: indiscriminate prescriptions of 
cure by rest, by liquids, by bland foods, by cathartics, by play, by sun- 
tanning, by religious observances. If a “scientific” culture such as ours has 
all these magical residues, it is not too much to expect that primitive inter- 
pretations of illness might be related to deep-seated anxieties. 

Whiting and Child specified five kinds of anxiety and developed 
criteria for rating these. These five were: oral, anal, sexual, dependence, 
and aggression. The assumption is that severity of training and severity 
of punishment for infraction in these areas, or failures of gratification, 
would lead to anxiety. A culture might be severe in one of these areas 
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and permissive in the others. In addition, Whiting and Child classified 
primitive interpretations of illness as derivatives of these anxieties. 

When the childhood experience was correlated with adult interpreta- 
tions of illness, some striking correspondences turned up. For example, 46 
cultures could be classified with respect to degree of oral anxiety in child- 
hood, and interpretations of illness with oral aspects, such as food poison- 
ing. Of 20 cultures high in producing oral anxiety in children, 17 used oral 
interpretations of illness. Correspondingly, of 19 cultures low in oral 
anxiety, only 6 used oral interpretations of illness. This kind of evidence 
does not prove a relationship between childhood anxiety and adult be- 
havior, but it makes such a relationship plausible. 

Another important suggestion from psychoanalysis is that of child- 
hood amnesia. In later life we appear unable to recall events that must 
surely have impressed us at the time, and about which we must have 
talked. Because these events become secondarily linked with repression 
sequences, they too are repressed. Memories that are recalled are often 
trivial, and perhaps distorted, the so-called “screen-memories,” probably 
serving some of the purposes of repression. This whole field has been 
touched hardly at all by academic psychologists, who ought to be able 
to explore it with children, and not have to depend upon the retrospective 
accounts of adult patients/ 2 

A further aspect of the developmental sequence posited by psycho- 
analysis, and relevant for learning, is the latency period , for it has been 
related to readiness for schooling (at about the age of 6) when the oedipal 
problems are temporarily solved and the ego is ready to feast its curiosity 
about the external world. After a few years adolescence produces a new 
threat, and learning may again be disrupted. 

Most of the attempts to study the latency period by non-psvcho- 
analysts have raised serious doubts about the existence of such a period. 
Blum, in reviewing the evidence up to 1952, reports that “there has been a 
growing trend, both within and outside psychoanalytic circles, to doubt 
the existence of sexual latency.” 63 One very interesting experimental study 
has since appeared, giving results consonant with an orthodox interpreta- 
tion of latency/ 4 The developmental assumption is that castration anxiety 
will be high early (before the age of 7) and late (as adolescent develop- 
ment begins in the teen years). Using the method of incomplete fables, 

p or a discussion from the psychoanalyst’s viewpoint, see A. Freud (1935), pages 
9-37, What data there were on the problem at the time were reviewed by G, J. Duaycha 
and M. M. Dudvcha ( 1941 ) . 

Blum (1953), pages 133-134. 

64 Friedman ( 1952). 
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Friedman found that between the ages of 7 to 12 his subjects more freely 
than younger or older children ended the stories with a strongly suggested 
dismemberment ending, showing reduced castration anxiety during this 
period. A supplementary finding was that latency by this criterion ends 
earlier for girls than for boys, a result that is plausible in view of the 
earlier onset of puberty among girls. Further material having to do with 
the sexual attachments of daughters for their fathers showed that from 
the ages of 7 to 11 the girls differed little from the boys, while from 11 
onwards there was a definite increase in the kinds of fantasies interpreted 
as sexual feelings toward the father. A decline was possibly setting in 
at age 16. 

While our topic is learning, not personality development, if the genetic 
theory of psychoanalysis contains any truth, learning will show dif- 
ferences from one level to another. Hence developmental stages are of 
possible significance for learning. 

Obstacles to Learning 

In case histories from child-guidance clinics we find many illustra- 
tions of obstacles to learning based on the personal history of the learner. 
The teacher as a parent figure may arouse false expectations or reinstate 
continuing battles; some symbols used in teaching may be so freighted 
with personal meanings as to be defended against; conflicts over authority 
may result in non-reading or in spelling handicaps, for no subject matter 
is clothed more with arbitrary authority than English pronunciation and 
spelling. 65 

There is little recognition within the learning laboratory of these 
matters so important for the kind of learning that takes place in the social 
environment of the home and school. At least one skilled teacher, Na- 
thaniel Cantor (1946), has recognized that social learning at the college 
level is influenced by defense mechanisms. In his book on the dynamics of 
learning he devotes chapters to the roles of resistance, ambivalence, pro- 
jection, and identification. 

Psychodynamics of Thinking 

The possibility of drawing upon psychoanalysis in studying the psy- 
chology of thinking has been brought strongly to the fore by the appear- 
ance of a large volume assembling papers on the organization and pa- 

65 Much of the relevant case material is reviewed by Mahler-Schoenberger (1942) 
and by Pearson (1952) (1954). 
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thology of thought. 60 Especially important for the psychoanalytic influences 
are the sections devoted to symbolism (including experimentally pro- 
duced dreams), to motivation of thinking, fantasy thinking, and pathology 
of thinking. 

While, as on most topics, one can find a few pioneer investigations, 
the suggestions from psychoanalysis have not yet been fully incorporated 
into the experimental study of thinking. For example, Seeman (1951) 
lists 23 titles as relevant to the Freudian theory of daydreams, but only 
seven of these use any techniques of quantification at all. 

Learning at its best must include creativity, whether in the form of 
creative imagination embodied in the arts, or in inventiveness in science 
or practical affairs. 07 The hints from psychoanalysis are deserving of study. 

The study of character or personality syndromes may prove useful 
in determining why of two people, equally intelligent, one is creative, the 
other not. 08 We have to be careful not to give pat answers, based on a 
stereotyped conception of “typical” personalities. For example, the com- 
pulsive dispositions associated with the “anal character” are not creative 
in essence, and may, in fact, produce mere handwashing instead of pro- 
ductive effort. Yet there is a certain plodding quality 7 about scientific data- 
gathering which may demand some of this same compulsiveness. Scien- 
tific objectivity, a valuable trait, is at the same time a form of dissociation 
or isolation between wish and intellect, and is perhaps achieved at some 
cost to the individual. Some papers by Kubie have stressed the personal 
problems of the scientist. 

In the current emphasis on general-education programs in our col- 
leges and universities we hear much about the integrative role of the arts 
and music. Yet as we look at artists and musicians we see some truth in the 
popular conception that they are somehow different, and occasionally 
classify as deviates. Hence the question recurs of the relationship between 
art and neuroticism. One form of putting it is this: Would the artist be 
less creative if he resolved his conflicts through psychoanalysis? Is not 
his neurosis precious to him? Possibly if we were successful in our mental 
hygiene programs we would develop a class of very uninteresting normal 
and contented people; perhaps we are saved from this by the ineffective- 
ness of our measures. There are more constructive possibilities. For one 
thing, artists and writers who come to analysis are those who find that 

m Rapaport (1951). See also Rapaport (1950). 

67 For a pertinent discussion, with references, see Vinacke (1952), pages 238-281. 

** E.g., Stein and Meer ( 1954). 

69 Kubie (1953) (1954). 
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they are afraid to practice their arts. For them, psychotherapy may give 
back their creativity. Again, the end of therapy is not necessarily to get 
rid of conflicts, but to find a way of living with them. The issues are too 
important to be resolved casually on the basis of a few case studies. 
Academic psychology ought to take its share of responsibility in the study 
of creativity at all levels. 

Therapy as Learning 

Experimental psychology will always be cut off from some of the 
more important aspects of psychoanalysis if it insists upon methods lying 
outside the psychoanalytic process itself. Without free association, the 
handling of the transference, and the rest that goes on within psycho- 
analysis, some kinds of data are simply not available. From the point of 
view of learning theory it is important to know in detail how transference 
operates to straighten out the confusions between past and present, be- 
tween magical omnipotence and reality, so that the patient unlearns 
neurotic habits and acquires realistic and socially acceptable ones. To 
know these facts and relationships in detail is not easy, for the non-verbal 
communications between patient and analyst are important along with 
the words spoken and those subtle nuances of language that express 
doubt, belief, or encouragement. For a time it was thought that when 
Freud abandoned hypnosis he had also abandoned suggestion. As trans- 
ference came increasingly into prominence as part of therapy, Freud 
realized that the handling of transference involved suggestion, and he 
did not hesitate to use the word. 70 But suggestion is a subtle process, and it 
is a question whether the most conscientious analyst can give an accurate 
retrospective account of just exactly what forms of suggestions he used 
with his patient, and how the patient reacted. Therefore the need of some 
sort of objective record is required. We probably need, in addition to the 
recorded interview, an immediately recorded statement from the analyst, 
giving what he thought he was doing, and something of his own free 
associations that remained unexpressed. Where the psychologist comes 
in will be in the analysis and interpretation of the data the analyst pro- 
vides. 71 

Perhaps the main conclusion to be drawn from recognizing the un- 
explored areas in developmental aspects of learning, in psychodynamic 

70 Freud ( 1920b), page 18. 

71 For a discussion of these possibilities, see Bronner, A. F., and others ( 1949 ) . The 
most satisfactory studies of psychotherapy at present are the investigations of client- 
centered therapy by Rogers and his associates, e.g., Rogers and Dymond ( 1954). For a 
treatment of psychotherapy as learning, see Shoben (1949). 
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obstacles to learning, in the study of thought processes and creativity, and 
in the understanding of the learning that goes on within psychotherapy, 
is that these problems can be solved only through co-operative effort. The 
division of labor between the academic psychologist and the psycho- 
analyst is not an absolute one, with the analyst doing everything that 
can be done within psychoanalysis, and the psychologist only that which 
can be done in the laboratorv. If we are to achieve the ultimate under- 
standing of the learning process not only in all its richness, but also in 
the form of verified propositions, there must be collaboration as well as 
division of labor. 

EXPERIMENTS ON REPRESSION AS ILLUSTRATIVE 
OF THE LABORATORY USE OF PSYCHODYNAMIC 
CONCEPTS 

As indicated earlier (page 300), psychologists were quick to pick up 
the suggestion from Freud that affective factors might influence recall. 
We shall consider two problems: the relation between affect and ease of 
recall, and the relation between affect and recognition thresholds. Both 
imply that unpleasant affect may hinder recall or recognition through 
events identified with or analogous to Freudian repression. 

Repression and Recall 

A review of the pertinent literature by Zeller (1950a) lists 93 refer- 
ences, enough to suggest how inviting this field has proved to be to in- 
vestigators. We shall consider a few experimental reports typical of four 
approaches to the study of affective factors in recall. 

1. Recall of associates to affectively-toned sensory stimuli. The most 
satisfactory study, following upon several earlier ones, is that of Ratliff 
( 1938 ) . He had subjects rate the pleasantness and unpleasantness of odors, 
pitches, and colors. Then the subjects associated numbers with the af- 
fective items and later were tested for recall. The results with odors were 
against the hypothesis of the forgetting of the unpleasant, but with pitches 
and colors the recall of numbers associated with the pleasant items -was 
superior to the recall of numbers associated with unpleasant ones. Because 
the pleasantness and unpleasantness of sensory stimuli has little personal 
relevance, such experiments are no longer considered very pertinent as 
tests of Freudian theory. 

Despite the tenuousness of the relationship to the repression theory, 
it is of some interest to cast up the score on experiments dealing with the 
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recall of pleasant and unpleasant items (including words as well as sensory 
stimuli). Of 51 studies of this kind, 32 (63 per cent) favor more effective 
recall of the pleasant over the unpleasant, 14 (27 per cent) favor the 
reverse, and 5 (10 per cent) have neutral or ambiguous results. 72 

2. Recall of memorized words with personal affective connotations. 
In order to introduce some kind of ego threat into the laboratory-type 
experiment, Sharp ( 1938 ) went to the case records of neurotic subjects 
to find words that would be emotionally unacceptable to individual sub- 
jects, and words that for them would express gratifications. She then com- 
pared their retention of unacceptable and acceptable words, and found 
the unacceptable words less well recalled. Unfortunately, the only re- 
ported repetition of portions of her work did not yield confirmatory re- 
sults. 73 

In an ingeniously designed experiment, Keet (1948) selected critical 
words from a word-association test given to each subject. One of these 
words was imbedded in a list to be memorized. The recall of the list was 
then disrupted by retroactive inhibition through new learning inter- 
polated between memorization and recall. The disruption presumably 
included repression, for the critical word was more difficult to recall than 
the other words. Counseling procedures were used to alleviate the anxiety 
associated with the forgotten word, to encourage its recall much as for- 
gotten experiences are brought to awareness in psychoanalysis. Although 
Keet s results were striking, those who have tried to reproduce the memo- 
rization and retroactive inhibition portions of the experiment have thus 
far been unsuccessful. The general logic of his experiment has so much 
to commend it, however, that it is to be hoped that some reproducible 
variant of his experiment can be discovered or invented. 

Both the Sharp and the Keet experiments are more relevant to Freud- 
ian theory than the other experiments on affective materials because they 
selected items with a history of personal affective loading in the life of the 
individual outside the laboratory. Then they brought these items into the 
laboratory for quantitative study, 

3, Recall of affectively-toned life experiences. In a somewhat different 
effort to achieve naturalness in experiments, experimenters have aban- 
doned the memorizing of material under the artificial conditions of the 
laboratory and turned instead to study of the recall of actual experiences 
met outside the laboratory. 

Meltzer (1930) asked 77 college men and 55 college women to de- 

72 The tally was made by Zeller ( 1950a), page 42. 

73 Heathers and Sears {1943); see Sears (1943). 
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scribe their Christmas vacation experiences. He not only had them list their 
experiences, but also asked them to rate the experiences as pleasant, un- 
pleasant, or indifferent. Six weeks later he unexpectedly asked them to 
repeat the listing they had done on the day of their return. Not only did 
they list more pleasant than unpleasant memories immediately after re- 
turning from vacation, but after six weeks the predominance of pleasant 
over unpleasant memories increased. This finding is in accord with the 
interpretation of repression as an active process, continuing after the 
original event, though other interpretations are possible. 74 

A social-psychological bent was given to the naturalistic-setting re- 
call experiment by Edwards (1941) (1942) who studied the retention of 
political statements by those differing in their political views. He found 
that his subjects tended to remember better material compatible with 
their political attitudes than material incompatible with them. Hence he 
emphasized the “frame of reference” in which remembering occurs, con- 
flict with prevailing beliefs perhaps being more important than pleasant- 
ness and unpleasantness in controlling selective forgetting. 

4. Repression induced by experienced failure. In a careful theoretical 
treatment of repression phenomena according to possibilities within 
stimulus-response theory, Sears (1936a) suggested a possible paradigm 
for the initiation of repression, and then proceeded to test it experimentally 
(Sears, 1937). The essential feature is that an excitatory tendency moving 
along a stimulus-response chain towards a consummatory response will 
be blocked and fail of consummation if, prior to consummation, a compet- 
ing tendency is also evoked. This competing tendency violates conscience 
or is a threat to self-esteem, and can arouse guilt or self-criticism. Thus the 
arousal of an anxiety response concomitantly with an anticipatory goal 
response may prevent the occurrence of the goal response. This is, in 
short. Sear s paradigm for repression. His own experiment introduced a 
failure task (card sorting) between two sessions of nonsense syllable 
learning. The interference with the second list following failure was in- 
terpreted by Sears as an indication of repression. 75 

Discouragement induced by failure might lead to a lowering of effort 
and hence lead to poorer performance on a later task without any active 
repression. Hence, as Zeller (1950a) has pointed out, a complete test of 
repression ought to show recovery when the associated threat is alle- 
viated. Several experimenters have attempted to demonstrate the two 

74 See, for example, Waters and Leeper (1936). 

The family resemblance between these failure-induced changes in recall and the 
kinds of changes in the completed-incompleted task experiment may be noted. The 
Zeigamik-type experiments have already been reviewed in Chapter 8. 



316 THEORIES OF LEARNING 

phases: induced repression followed by the lifting of the repression. One 
of the first of these was done under hypnosis (Huston, Shakow, and 
Erickson, 1934). Suggestions under hypnosis led to the attachment of 
guilt to certain words. In the waking state, while amnesic to the hypnotic 
experience, the subjects showed through word associations an influence 
on associations to the guilty words that might be interpreted as repression. 
Following removal of the suggestions under rehypnosis, the repressive 
effects disappeared. Diven (1937) used a conditioning method, following 
certain word pairs with shock. In the recall of the paired-associates, sub- 
jects did much better with the non-shocked than with the shocked pairs, 
as though recall of the "dangerous” words had been inhibited. After a "de- 
conditioning” series, recall proved superior for the shocked words, hinting 
that an unconscious (repressed) integration may be a strong one. An 
attempted repetition of Diven’s work has not been successful (Cannicott 
and Umberger, 1950). There are apparently subtleties in this kind of ex- 
periment, as suggested also by Sharp's and Keet’s, that are not com- 
municated in the description of the experimental procedures. 

The initiation of repression through induced failure has been studied 
in an extension of Sears type of experiment by Zeller (1950b) (1951), 
Russell (1952), and Aborn (1953). These experiments, with the exception 
of Russell's, found not only a disruption of recall, but some subsequent 
recovery when threat was alleviated. 

Zeller used nonsense syllable learning and retention as the basic task, 
interpolating success and failure experiences in imitative tapping of Knox 
cubes. His experimental design included two tests of retention prior to 
the initiation of "repression,” then an immediate and a delayed (48 hour) 
test while "repression” was active, and finally an immediate and delayed 
test after removal of "repression.” His results for both recall and relearning 
corresponded to predictions from Freudian theory, though he pointed 
out that alternative interpretations cannot be excluded. Zeller used four 
conditions, so that he was able to compare the inhibiting results of failure 
specific to a second task with the inhibiting results of failure on a second 
task that had become associated with the performance of a previously 
learned task. Repression was not found when the failure was highly 
specific, but it was found when failure on the irrelevant task became as- 
sociated with performance on the earlier learned one. 

Abom (1953) added another dimension by comparing the results for 
intentional ("set”) learning with those for incidental learning. His sub- 
jects, reading the colored numbers on cards selected from the Ishahara 
and Dvorine tests, thought they were taking a color-blindness test. Those 
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“set” to learn were told to remember the numbers that appeared on the 
cards; the “incidental” learners received no such instructions. The “ego 
threat” was introduced for the experimental subjects by following the 
series of numbered cards with others on which no numbers appeared, thus 
implying that the subject was color weak because he could not distinguish 
a number against the background of colored dots. The threat was en- 
hanced by having him fail on some presumably related perceptual tasks, 



FIGURE 44. Repression and recovery in incidental learning associated with ego 
threat. The recall scores are of numbers learned incidentally while taking a color- 
blindness test. Mean, 20 subjects m each group. Rcplolted rrom the data of Abom 
(1953), page 228. 

stencil designs taken from the Arthur Performance Scale. As a test of 
memory, the subject wrote down the numbers remembered from the color 
test cards, then checked the numbers on a recognition test. The memory 
tests were unexpectedly repeated two days later. At this point the “threat” 
was removed by revealing the true nature of the experiment. The memory 
test was again given unexpectedly two days later and again after three 
days. The results for the incidental learners are given in Figure 44. For 
the control subjects there is progressive decline (“forgetting”) with the 
passage of time. For the experimental subjects, however, there are reduced 
scores after failure, followed by recovery after removal of threat. The 
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higher scores at the end, for the experimental subjects, correspond to 
Diven's finding, mentioned above. There were no differences of this kind 
for the subjects who had been given a set to learn; hence Aborn concludes 
that there are advantages in using incidental learning for this kind of 
study, and perhaps this fact helps to account for Russells ( 1952) negative 
results. 

Where do these studies lead us? The facts of amnesia, without addi- 
tional experiments, make abundantly clear that repression occurs, and that 
memories once lost can be recovered without relearning. 76 While there- 
fore experiments are not needed to establish the phenomena of repression, 
they may help us to delineate the precise circumstances under which 
repression occurs. Clarification of repression phenomena is today more 
important than establishing their genuineness. 

Repression and Recognition Thresholds 

While measures of recognition were used as tests of memory in some 
of the foregoing experiments, a rather different type of recognition experi- 
ment has recently come into prominence. In these experiments recognition 
refers to the perceptual recognition of well-learned material, as in the 
reading of a word under brief exposure. This differs from recognition as 
a test of memory for barely learned items. If the material to be perceived 
has connotations that are unpleasant, anxiety-producing, threatening, it 
may arouse perceptual defense , a warding off that results in higher thresh- 
olds. The tendency not to perceive is, then, a form of repression, a defense 
against the aroused meanings. 77 

The contemporary series of experiments had roots in earlier experi- 
ments but began most strikingly with a report by McGinnies (1949). 
He recorded simultaneously recognition thresholds and galvanic skin 
responses (GSR's) to neutral and emotional toned (taboo) words, pre- 
sented by a tachistoscope. During the prerecognition interval (i.e., while 
the exposure time was still so short that the subject could not name the 
word presented) the subject produced larger galvanometer deflections to 
the words he was going to have greater difficulty in recognizing than to 
the words which would prove eventually to have lower thresholds. The 
interpretation was that the GSR revealed anxiety connected with the taboo 
words, and that the high threshold for verbal response represented a per- 
ceptual defense against recognition of the forbidden words. 

76 Some good cases from combat neuroses are given by Grinker and Spiegel ( 1945a ) 
( 1945b ) , 

77 Klein (1954), pages 268-270, quite properly argues against assigning all cogni- 
tive controls in experiments of this kind to “defense.” 
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This study was followed almost immediately by criticisms and supple- 
mentary experiments, so that within the next three years a dozen reports 
appeared bearing upon it, and they have continued since. 78 

Some of the criticisms have to do with experimental logic, and some 
have to do with interpretations. Because all of the studies happen to treat 
of closely related substantive material, a review of a few of them will serve 
to indicate the complex causal factors that operate even within well- 
defined and restricted experimental contexts. 

The most telling criticism against McGinnics* results as a demonstra- 
tion of perceptual defense lies in the difference between his emotionally- 
toned taboo words and his neutral words in frequency of me in ordinary 
speech, as objectively defined by magazine word-counts. The criticism, 
first proposed by Howes and Solomon (1950), has led to a number of ex- 
periments demonstrating conclusively that the more common words have 
lower recognition thresholds than less common words, quite apart from 
their affective significance. 79 

Following up this criticism. Postman, Bronson, and Gropper (1952) 
equated critical and neutral words for frequency of ordinary use. They 
not only did not confirm McGinnies’ finding, but obtained the very op- 
posite result: lower thresholds for taboo words. While, therefore, the 
commonness of the words used must be controlled, does the total effect 
always disappear when that control is exercised? Not necessarily. Mc- 
Ginnies and Adornetto ( 1952 ) controlled word familiarity in experiments 
with schizophrenics, and found higher thresholds for taboo words, as in 
McGinnies’ original experiment. 

At least three additional determiners, besides frequency of word use, 
have been shown to affect recognition thresholds, 

1. If subjects are given incomplete information about what kinds of 
words to expect, the difference between taboo and neutral words is greater 
than if information is given in advance (Lacy, Lewinger, and Adamson, 
1953). Advance instructions obviously have nothing to do with experienced 
word frequency, though they may produce a readiness to perceive less 
familiar words. 

2. An increase in need-relevant motivation can influence recognition 
thresholds for related words, whether the words are common or uncom- 
mon ones (Wispe and Drambarean, 1953). They found the recognition 

78 For a review through 1952 see Postman ( 1953). 

79 For further work on the relationship between frequency and threshold, see Howes 
and Solomon (1951), Howes (1954), McGinnies, Comer, and Lacey (1952), Postman 
and Schneider (1951), Solomon and Howes (1951), Solomon ana Postman (1952), 
Wispe and Drambarean (1953). 
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thresholds for need-relevant words reduced after 10 and 24 hours without 
food and water, even though the thresholds both at high and low need 
states were correlated with the commonality of the words. 

3. The nature of the choices confronting the subject can influence 
the results, apart from the familiarity of the words. Thus a forced-choice 
method, in which the subject was required to choose a syllable from a list 
to correspond to the briefly exposed one, produced more difference be- 
tween thresholds for shocked and non-shocked syllables than a free-choice 
arrangement (Lysak, 1954). 

These supplementary findings do not diminish the need to control 
word frequency, but they do suggest that the phenomena studied ex- 
perimentally are functions of many other circumstances. 

Another criticism of the McGinnies type of experiment is that de- 
liberate suppression may take place, producing reduced response but for 
reasons differing dynamically from Freudian repression . 80 There may be a 
social hesitation to say taboo words, even though the words are recognized. 

Another line of investigation is related to McGinnies’ finding of GSR 
responses differentiating between neutral and taboo words at exposure 
times below threshold levels. The finding suggests renewed attack upon 
subthreshold perception, or subception , as it has come to be called. The 
issues are complex, but it usually turns out that there are sufficient supra- 
threshold features to account for most of the phenomena. 81 It is difficult 
even to know whether we are dealing with perception or a retrospective 
interpretation based on previously perceived fragmentary information 
(Lawrence and Coles, 1954), or whether we are dealing with perception 
or verbal habits (Neisser, 1954)* 

In the attempt to remove some of the artificiality of the taboo-word 
experiments, Eriksen (1951) proposed to make an independent study of 
personality needs and then predict what kinds of perceptual defense would 
occur. 82 Using a word-association test he scored his subjects (male 
patients in a veterans hospital) on needs related to aggression, succorance, 
and homosexuality. He then presented pictures tachistoscopically, with 
scenes related to these needs. He predicted that recognition thresholds 
would be raised for pictures suggesting disapproved needs. While the 

80 This criticism, and closely related ones, have been made by Howes and Solomon 
(1950), Lacy, Lewinger, and Adamson (1953), Lysak (1954), Postman, Bronson, 
andGropper (1952), Whittaker, Gilchrist, and Fischer (1952). 

81 See, tor example, Bricker and Chapanis ( 1953 ), Lazarus and McCleary ( 1951 ). 

82 This meets in part Klein's (1951) contention that we need to know more about 
the individual personalities of the subjects used in these experiments. In his experi- 
ments “high-interference” subjects sometimes react opposite to “low-interference” ones 
(Klein, 1954). 
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correlations obtained were not high, enough of them were significant to 
lend plausibility to the hypothesis that unacceptable needs tend to be 
defended against in perception. 8 ** 

The phenomena of repression have not yet been very much illumi- 
nated by experiments on perceptual defense any more than they have by 
experiments on the ease of recall. Patience is needed, however, and it is 
too much to expect that as these topics are newly brought into the 
laboratory we can achieve at once both richness and precision. 

ESTIMATE OF FREUD S CONTRIBUTION TO LEARNING 

Freud's Position on the Typical Problems 

The foregoing account shows many points at which Freudian theory 
has impinged upon the psychology of learning. A recapitulation is in 
order to summarize his views as they relate to the several problems with 
which learning theories deal. 

1. Capacity. As a developmental psychology, Freud’s theory implies 
that the very young infant is most impressionable, 84 so that primary re- 
pressions occur in the first years of childhood, and character syndromes 
find their origins in the conflicts over food, toilet training, sex, aggression. 
The resolution of these conflicts takes place according to fundamental 
themes or styles of response, and the rest of life is spent in playing out 
these early themes with the assimilation, of course, of new content. Fun- 
damental changes in the structure of the personality do not take place 
through ordinary education; such changes require the special kind of re- 
education provided by depth therapy. Alongside these modes of conduct 
determined by instinctual conflicts there are the conflict-free ego processes, 
which are the kinds dealt with in ordinary give-and-take with the real 
environment. At some stages of life the conflictual personal problems loom 
unusually large, and make difficult the adjustment to external reality im- 
plied in school learning. The easiest time for conflict-free learning should 
be during the "latency period,” and after adolescent conflicts are resolved, 
but individual differences are to be expected throughout because some 
children do not outgrow their earlier fixations, and some conflicts remain 
active and disruptive at any age. 

2. Practice . The repetition-compulsion is not a principle of learning- 
by-doing, for ordinarily the compulsive behavior is peculiarly resistant to 
change according to its consequences. The principle of learning through 

» 3 See also Enksen (1952); Lazarus, Eriksen, and Fonda (1951). 

84 This may possibly be related to “imprinting,'* as emphasized by the ethologists. 
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practice is better illustrated by “working through/' that aspect of therapy 
in which the patient faces the same conflicts over and over again in the 
process of re-education. Learning takes place in “working through" be- 
cause the conflicts are faced from new angles, and the cues to faulty con- 
duct become detected early enough so that the behavior can be deflected. 
Repetition is needed for learning, but for repetition to be effective it must 
be repetition with a difference. 

3. Motivation . Freud's is chiefly a psychology of motivation, and he 
detected motivational control in kinds of behavior that others had thought 
of as trivial or accidental, such as minor forgettings and slips of speech. 
The first serious effort to incorporate psychoanalytic theory into general 
psychology recognized the Freudian wish as providing a theory of motiva- 
tion. 85 The motivational concepts that have impinged most directly upon 
contemporary learning theory are anxiety (as a learned drive), and the 
consequences of various ego threats , as in the studies of regression, ag- 
gression, repression, and the defense mechanisms generally. The emphasis 
upon sex as a master drive has been less influential in learning theories 
than in other fields of personality study, although many of the ego- 
threatening situations used have included sexual conflicts. 

4. Understanding, Despite Freud's preoccupation with the irrational 
in human behavior, his theory lays great stress upon the possibilities of 
cognitive control. We can even gain insight into our unconscious processes, 
if we work at it properly — no mean achievement for the intellect. The 
developing “ego psychology," playing up the “conflict-free ego sphere," 
allows even more room for rationality in the control of conduct. 

The aim of psychoanalytic therapy is to get rid of self-deception and 
other blocks to rationality. To the extent that the methods are successful, 
they should provide some principles useful for learning and teaching. 

5. Transfer, We must not permit verbal equivalences to lead us to 
assume identity where identity does not exist. “Transference” has a special 
meaning in psychoanalysis. It refers to the special role that the therapist 
plays for the patient as the therapist stands from time to time for important 
people in the patient’s life, perhaps the mother, or father, or older brother. 
The patient reacts to the therapist with the emotions appropriate to these 
other people. Thus psychoanalytic transference does share with transfer 
of training the fact of generalization of responses learned in one situation 
to novel but related stimuli. 

Problems of equivalence loom large in psychoanalysis. By way of 
symbols, something commonly stands for something else and provokes the 
83 Holt (1915). 
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responses (especially affective responses) appropriate to that something 
else. We have seen this process at work in displaced aggression. Other 
processes of symbolization and condensation are relevant to the manner 
in which earlier and later learning come into psychological relationship. 

6. Forgetting. Freud long championed the view that registration of 
early experiences persists throughout life, that forgetting is therefore 
chiefly the result of repression. Forgetting is mostly a kind of amnesia, 
and under appropriate circumstances (as in dreams) the persistence of 
these memories may be detected, while under other circumstances (as 
in psychoanalysis ) the repression may be lifted and the memories restored 
to waking consciousness. Although the most important repressions take 
place first in early childhood, the repressive process continues throughout 
life, maintaining the original repressions, and adding new items to the 
unconscious store. 

Significance of Pstjchodtjnamics for Learning Theory 

One influence of psychoanalytic thinking has been to broaden the 
topical content studied within the field of learning. Psychoanalytical think- 
ing has helped to erase the boundaries between the neurotic and the 
normal, so that what was once relegated to ‘'abnormal psychology” now 
becomes part of general psychology. Because there is a recognized con- 
tinuum between the neurotic and the normal, the learning of symptoms 
becomes continuous with the learning of mannerisms and the acquiring of 
attitudes. Symbolism in night dreams makes us look for symbolism in day- 
dreams, and in other products of creative imagination and thought. What 
we learn from unusual perceptual distortions, hallucinations, and amnesias 
influences how we think about ordinary perception in relation to the needs 
of the perceiver, and ordinary forgetting as related to motivation. Hence 
the range of topics that interest students of learning has been extended 
to include perceptual distortion, repression, and symbolism, all implying 
personal and idiosyncratic modes of expressing the results of learning. 

In one form or another the conception of unconscious determination 
has made important changes in thinking about human motivation. Psy- 
chologists seeking to maintain the behavioristic orientation of the 1920’s 
were somewhat resistant to consciousness and even more so to an un- 
conscious consciousness. But even Watson was influenced strongly by 
Freudian doctrine and accepted "the unverbalized” as a substitute for the 
unconscious. 86 Thus the subjective-objective controversy did not turn out 
to be a decisive one, so far as incorporating motives of which the learner 

88 Watson (1924). 
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was unaware (or of which he could not speak). Freud can be credited with 
being the first to propose that repression leads to the inability to verbalize: 

Now, too, we are in a position to state precisely what it is that repression 
denies to the rejected idea in the transference neuroses — namely, the translation 
of the idea into words which are to remain attached to the object. 87 

Any appraisal of psychoanalytic influences would be most incomplete 
if it did not acknowledge the emphasis upon unconscious processes and 
their derivatives. This is the most obvious way in which to acknowledge 
the dynamic contribution of psychoanalysis. 

Finally, the genetic or developmental aspects of psychoanalysis have 
brought to the fore the need for an adequate eg o psychology , 88 If we are 
to understand the learner as he sets his goals and works realistically toward 
them, or as he is torn by conflicts that prevent his using his abilities, or 
as he bums himself out in the quest for futile objectives, we need a theory 
of personality organization incorporated within our general theory of 
learning. 

The goads from psychoanalysis to broaden the content studied within 
learning, to recognize the range of motivational determiners, including 
unconscious ones, and to build toward an ego psychology are acceptable 
ones, and the acceptance of these challenges has already proved reward- 
ing. A word of warning is in order. Psychoanalytic thinking is very com- 
plex and loosely formulated, so that it is not easy to find out what is es- 
sential, what is dispensable, and what internal contradictions there are in 
it. We need a careful, critical systematization of what appears to be best 
substantiated through clinical experience and through other sources of 
scientific evidence, so that irrelevancies and contradictions are either dis- 
carded or stated in form for decisive testing. We need to locate the most 
pertinent issues, and then to do our best to get adequate evidence to re- 
solve these issues. There is much in Freud that nearly everyone will 
reject, such as the inheritance of acquired characteristics, and some meta- 
phorical statements about localization of function in the brain. The prob- 
lem is not to guess what Freud “really” meant and then to defend or refute 
him. The problem is to find out what is true, regardless of who said it. 
Freud uncovered some interesting hypotheses. We need to state them as 
carefully as we can, and then put them to the test. 

87 Freud (1915b), 133-134. Miller (1948b) has quoted these words and developed 
a related theory of repression in relation to displacement. 

8S $ee, for example, Symonds (1951). Misbach (1948), after a careful review of 
Freud’s statements, concludes that his contributions to learning theory lie largely 
in the emphasis upon ego ; r '/ .i:: ss the background for learning. Misbach believes 
the pleasure-pain ( law of effect ) principles in Freud to be secondary. 
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The Need for Collaborative Research 

If psychologists are to avoid working with pallid replicas of the events 
dealt with in psychoanalysis, some of their work will have to be carried 
on in collaboration with those experienced in depth therapy. The psy- 
chologist will have to gain such sophistication as he can through personal 
experience with the phenomena, while he also learns the necessary meth- 
ods and controls required to do research with them. Some of the problems 
will elude him unless he works closely (at least some of the time) with 
those who are dealing intimately with the more extreme manifestations of 
the phenomena — with emotional panic instead of the irritation of mild elec- 
tric shock, with traumatic amnesia instead of the repression of mildly taboo 
words, with transference of hallucinatory intensity instead of a comfort- 
able rapport, with the unusual states created under drugs and hypnosis in 
addition to those created by a few hours of hunger. It may be that some 
research problems in this field can be solved only with types of facilities 
that are but now coming into being . 80 
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raphy adequate to the needs of the student of psvchology, is; 

Hall, C. S., and Lindzey, G. ( 1954) Psychoanalytic theory and its applications 
in the social sciences. In G. Lindzev, eel. Handbook of social psychology , 
143-180. 
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Jones, E. (1953) The life and work of Sigmund Freud . Vol. 1. The formative 
years and the great discoveries , 1856-1900. 

Jones, E. (1955) The life and work of Sigmund Freud . Vol. 2. Years of maturity , 
1901-1919. 

The author is a disciple of Freud who has enough confidence in the greatness 
of his master so that he does not find it necessary to soft-pedal Freud's human 
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Chapter 10 

FUNCTIONALISM 


T HE eight learning theories we have examined in the preceding chap- 
ters include one stimulus-response theory that antedated behavior- 
ism (Thorndike), three varieties of stimulus-response behaviorism (Guth- 
rie, Skinner, Hull), one "purposive” or "expectancy” behaviorism (Tol- 
man), two variants of gestalt theory (classical gestalt and Lewin), and a 
psychodynamic theory (Freud). For the final major position to review, we 
return to the one out of which behaviorism grew but which flourished 
alongside it and continues to be well represented in contemporary psy- 
chology. It is the loosely formulated position known as functionalism, ap- 
propriately treated last because of its eclectic character and because its 
supporters have been free to learn from all of the other theories of learn- 
ing- 

While the name "functionalism” arose in connection with the brand 
of psychology developed by Angell and Carr at the University of Chicago, 
the name is equally applicable to the dynamic psychology that Wood- 
worth taught at Columbia. Boring considers Woodworth "perhaps the 
best representative of the broad functionalism that is characteristic of 
American psychology .” 1 Woodworth thinks of functionalism as a middle- 
of-the-road position, not really a "school” of psychology . 2 

GENERAL CHARACTERISTICS OF FUNCTIONALISM 

Functionalism in one form or another has a long history. In Europe 
such men as Galton, Binet, and Claparede could be classified as function- 
alists. In America, James, Hall, Ladd, Baldwin, Cattell, and Seashore were 
all functionalists in outlook. While we turn to the University of Chicago 
first because it was there that functionalism became defined, functionalism 
early in the century was "in the air” at Clark under Hall, at Columbia 

i Boring (1950), page 722. 

2 Woodworth ( 1948 ), page 255. 

328 
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under Cattell and Woodworth, and at Yale under Ladd, Scripture, and 
Judd. 

The Influence of John Dewey 

John Dewey (1858-1952) is the “founder” of official functionalism. 
His famous paper on the reflex-arc concept in psychology ( 1896), through 
its clear opposition to prevailing elementarism, marked the beginning of 
the new protest. One might have expected him to direct his attack against 
sensationism, for sensation was the fundamental element in the Wundt- 
Titchener psychology. Instead the attack was on the reflex-arc concept, 
which, as Heidbreder has pointed out, was the very concept that had 
been introduced to move beyond a static description of elements of con- 
sciousness. 3 There was, in fact, a controversy going on between Baldwin 
and Titchener over reaction time (involving the reflex arc) and the first 
experimental paper using Dewey’s formulations was a study of reaction 
time by Angell and Moore (1896), mediating the Baldwin-Titchener 
controversy. 4 

Dewey’s main argument was that activity should not be thought of as 
starting with a stimulus, going through a central process, then emerging 
in a response. Instead, the activity is a complete cycle — a “reflex circuit” 
— in which the response may seek or “constitute” the stimulus. The rela- 
tion of the response to the posterior stimulus may be as important as its 
relation to the anterior one. 5 

Functionalism was christened by Titchener (1898) when he attacked 
Dewey’s position, contrasting functionalism with structuralism. The chal- 
lenge was accepted by Dewey s younger colleague, James Howland Angell 
(1869-1949), who became the leader of the new movement. We shall 
return to him presently. 

Dewey turned from academic psychology to education before he left 
Chicago, and he went to Columbia in 1904 never again to concern him- 
self directly with the problems of laboratory psychology. But he had 
already left his mark on the functional psychology that continued after he 
moved on. The animal laboratories and the studies of child development 
express the spirit of his Darwinism, with its emphasis upon the evolution 
of adaptive mechanisms.® Not much of his later writing appears in psy- 

3 Heidbreder ( 1933), page 212. 

* Baldwin and Shaw ( 1895), Baldwin ( 1896); Titchener ( 1895a) ( 1895b) ( 1896). 

5 The same line of argument was picked up by a later Chicago psychologist, Thur- 
stone (1923). There is a family resemblance between Dewey’s position and Skinner’s 
operant behavior, in which responses are coordinated with the stimuli to which they 
lead. 

6 Dewey ( 1920). 



330 THEORIES OF LEARNING 

chology textbooks; later references in them are chiefly to his How we 
think (1910), which gave a practical and teachable account of some 
steps in problem-solving modeled after the steps in scientific investiga- 
tions. 

It would be a mistake, however, to think of Dewey’s influence upon 
the psychology of learning as limited to the work of those who developed 
functional psychology within the laboratories. At least three other lines 
of influence can be discerned, each bearing his imprint. 

The most marked of these, for its relevance to the psychology of learn- 
ing, is his influence upon ways of thinking about learning as the product 
of schoolroom practices. This influence began with his founding of the 
experimental elementary school at the University of Chicago, 7 and the 
subsequent development known in its heyday as progressive education. 
His emphasis upon interest and effort, upon the child’s motivation to 
solve his own problems, represented a dynamic innovation for which the 
laboratory psychologists were not yet ready. Consequently Dewey’s chal- 
lenge was taken up by educational philosophers rather than by psycholo- 
gists. 8 Now it so happens that cousins may each resemble a common 
grandparent without resembling each other. Although Dewey’s influence 
is clear in Carr’s Psychology (1925), and in Kilpatrick’s Foundations of 
method (1925), the detailed learning theories of these two books, the first 
a psychologist’s and the second an educator’s, have little in common. Pro- 
gressive education, at its best, was an embodiment of the ideal of growth 
toward independence and self-control through interaction with an environ- 
ment suited to the child’s developmental level. The emphasis was upon 
intelligent problem-solving ( not upon learning through play), in which 
each child solves his problems by selecting appropriate materials and 
methods and by learning to adapt these materials and methods to his 
ends. His interest sustains his effort as he experiments with his solutions 
by testing them in action. The kinds of problems solved are social as well 
as individual, for education is envisaged as a preparation for life in a 
democracy through democratic living here and now. How different this 
all sounds from the rules for memorizing a list of nonsense syllables in the 
minimum number of trials! 

The second line of development, outside academic psychology, was 
upon sociological social psychology, by way of George Herbert Mead 
(1863-1931), who came with Dewey from Michigan to Chicago. Mead, 
whose views were very similar to those of Dewey, became the guiding 

7 For the early history of this school, with the point of view that infused it, see 
Mayhew and Edwards ( 1936). 

s E.g., Childs ( 1931 ), Bode { 1940). 
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spirit for developments in sociology paralleling those of Dewey in psy- 
chology and education. Mead is important for contemporary learning 
theory because he sponsored a form of self- or ego-psychologv a whole 
generation before experimental psychologists were ready to take it up. 0 
Meads interpretation of the self as arising out of social interaction is 
close to the spirit of Dewey the social philosopher and the defender of 
democratic values, who saw every act as the expression of a unified self 
seeking to resolve its conflicts through intelligent action in a world of 
objects and other selves. 

Finally, we have Dewey the philosopher, carrying on the pragmatic 
tradition of Peirce and James in a form known as instrumentalism or 
experimentalising Even late in life he continued to write upon logic ( 1938) 
and upon epistemology (with A. F. Bentley, 1949), always deferring to 
the scientific method as the one that has helped man to acquire his firm- 
est knowledge. He insisted that ordinary mortals, without the scientist's 
equipment, could learn lessons from the method of science helpful in 
the conduct of their daily lives. 10 

A case could be made for bringing these diverging trends into a new 
synthesis that would have significance for the psychology of learning. 
Indeed, in ill-defined ways something of the sort may already be happen- 
ing, as we meet new emphases within learning theory upon ego aspects 
of learning, upon searching behavior and intelligent problem-solving, 
upon learning in social contexts. These new emphases supplement a prior 
preoccupation with quantification and a somewhat constricted methodol- 
ogy. A new synthesis, strictly in the functionalist tradition, may legiti- 
mately acknowledge Dewey in its ancestry. History has a way of moving 
in cycles. Shortly before Dewey s death his interpretation of perception 
as a transaction between the organism and the environment (foreshad- 
owed in the original paper on the reflex arc ) began to find its way back 
into the literature of psychology. 11 Possibly the rediscover) 7 of other use- 
ful ideas may cause a re-evaluation of Dewey’s place in the history of 
psychology. 

Angell and His Successors 

We return now to the main line of development of functional psy- 
chology, which went forward under Angell and his successors. Harvey 

0 Mead’s influence is discernible clearly in Cottrell and Gallagher (1941), and in 
Newcomb (1950). One learning theorist makes direct acknowledgment to him: 
Muenzinger (1942). 

10 For Dewey s philosophy, see Ratner ( 1939 ) . 

lx E.g., Kelley (1947). Meson and Cantril (1954). 
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Carr succeeded him at Chicago; Edward S. Robinson, a younger man, 
taught at Chicago before he went to Yale to carry on a similar program 
until accidental death cut short his career. Next to carry the mantle was 
John A. McGeoch, whose career, too, was ended by death at an early age. 
The students of Carr, Robinson, and McGeoch continue to be active, such 
men as Bills, Heron, Irion, McKinney, Melton, Waters, Underwood. Their 
students, in turn, keep the flavor of Chicago functionalism alive, although 
no one of those mentioned is now at Chicago, and several of them never 
were. 

In an early clear statement of the functionalist position Angell (1907) 
made three points: 

1. Functionalism is interested in the how and why of mental opera- 
tions as well as in the what (i.e., descriptive content). It is a cause-and- 
effect psychology, or in modern terms, an input-output psychology, con- 
cerned with mental operations in their context. 

2. Functionalism is essentially a psychology of the adjustment of 
the organism to its environment. Consciousness evolved to serve some 
biological purpose, to help the organism solve its problems especially when 
conflicts arise and habits no longer suffice. Once this position was taken, 
the way was open to welcome applied psychology: educational psychol- 
ogy, industrial psychology, mental hygiene. 

3. Functionalism is interested in mind-in-body, and so studies the 
physiological substratum of mental events. The implied dualism is a purely 
practical one ( as in current psychosomatic medicine ) and does not imply 
a special position on the mind-body problem. The only position that must 
be rejected, if one is to stress the adaptive role of consciousness, is that 
known as epiphenomenalism, the view that consciousness is a useless by- 
product of neural activity. 

Now that the quarrels within psychology have shifted somewhat, and 
the definition of psychology’s province is no longer a matter to fight about, 
the word “function” has tended to return more nearly to its mathematical 
use, as in the expression, y = f (x), (“y is a function of %”). This causes no 
great shift for the functionalist, for all he asked was to be free to study 
many sorts of contingencies, to find what depended on what, whether the 
dependency was upon age, species membership, prior practice, the in- 
gestion of a drug, or accepting the experimenters instructions. Neverthe- 
less, the shift is away from the earlier emphasis upon adaptiveness, upon 
purposes served. While the functionalist is free to work on applied prob- 
lems, the freedom is one of choice, and a functionalist may work with 
“pure” functions if he chooses. There is a kind of neutrality about the 
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mathematical function that makes it particularly appealing to the scien- 
tist; like the correlation coefficient, it expresses a relationship without 
requiring anyone to claim that it explains the relationship, If it does lead 
to a kind of explanation, there need be nothing teleological about it; there 
is no implication, with Voltaire, that God must have given man a nose 
in order to have a support for his spectacles. If some noses support spec- 
tacles more readily than others, or if some kinds of spectacles fit more 
kinds of noses, that is all that the functionalist is interested in finding 
out. 


The Background of Woodworth's Functionalism 

Like Dewey, Woodworth was influenced by William James and G. 
Stanley Hall, although he studied only with James. His functionalism 
derived in part from them, and from his early association with Thorndike, 
Cattell, Ladd, and Sherrington. Both Angell and Woodworth became 
interested in imageless drought, and though they disagreed about it, the 
problems of imageless thought raised the question of meaning in rela- 
tion to consciousness, a good functionalist topic. 1 - 

Wood worths functionalism gave considerably more place to motiva- 
tion than the earlier Chicago functionalism. The role of motivation was 
prominent in Woodworths Dynamic psychology (1918), and his sugges- 
tions were heavily leaned upon when motivation became incorporated 
into Chicago functionalism by Carr (1925). 

Apart, then, from their sharing ancestry in men like James and Hall, 
the developments at Chicago and at Columbia were not intimately related 
but were mutually congenial, and with the passing of time the differences 
between them have gradually faded so that we can treat the general out- 
look as one, particularly as that outlook is reflected in contemporary psy- 
chology. 

The Flavor of Functionalism 

Because contemporary functionalism is so loosely articulated, a point 
of view without acknowledged leadership and with few loyal and self- 
conscious adherents, it defies precise exposition. Hence we must be con- 
tent to come to an appreciation of its flavor rather than to achieve final 
judgment about its principles. 

1. The functionalist is tolerant hut critical The functionalist is free 
from self-imposed constraints that have shackled many other svstematists. 

i- The functionalist Rahn (1914) pointed out that it was impossible to introspect a 
meaningless element. This was very damaging to the structuralist position of Titehener. 
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He uses the words from diverse vocabularies, borrowing words freely from 
other traditions. He is not forbidden the use of older words because today 
they sound subjective (e.g., "idea,” "meaning,” "purpose”), or because 
they have occasionally been given systematic connotations that he does 
not accept ("sensation,” "image,” "ego”). He does not believe that any- 
thing is gained by new terms, unless advance in knowledge justifies the 
further precision that new words can bring. For example, he believes it 
premature to call all thinking "implicit verbalization,” for the objective 
terminology is not yet justified by what we know about thinking. Hence 
he holds to the older word. His definition of the field of psychology is also 
a tolerant one, and he is ready to accept information obtained by intro- 
spection, by objective observation, from case studies, from mental tests. 
He is tolerant as to method, and he is also tolerant as to content. The dis- 
tinction between pure science and applied science seems to him trivial, 
so long as either is good science. 

It is a mistake to confuse this broad tolerance with looseness, as 
though functionalism is merely an uncritical eclecticism. On the contrary, 
the functionalist is commonly a very astute critic. Because he has his eyes 
open for variables that may be ignored by more dogmatic systematists, he 
is not easily trapped into accepting "pat” systematic solutions for intricate 
problems. This critical attitude is expressed in a preference for relativism 
over absolutism , 13 Recognizing the many determiners of psychological 
activities, Carr deplored what he called the quest for "constants,” by 
which he meant one-to-one correspondences, or psychological laws stated 
without reference to all the influential variables. 

2 . The functionalist prefers continuities over discontinuities or typol- 
ogies. The mathematical statement of a functional relationship usually 
implies a gradual transition between the values of the dependent variable 
correlated with the independent variable as it increases or decreases. 
While extremes may differ so markedly as to appear qualitatively unlike, 
the functionalist looks for connecting in-between cases. This continuity 
of function was made the basis for a dimensional principle by McGeoch, 
and the suggestion has been elaborated by Melton . 14 

The two chief classes of variation recognized by Melton are those 
dependent upon experimental arrangements, which may be called situ- 
ational dimensions , and those that depend upon the psychological func- 
tions involved, which may be called process dimensions . 

Carr (1933). 

14 McGeoch ( 1936), Melton ( 1941a) ( 1950). The dimensional principle was applied 
earlier to a modem version of the attributes of sensation by Boring (1933a). 
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The situational dimensions call attention to the differences between 
what the learner is called upon to do from one experiment to another. A 
very different problem is presented to the learner by a path through a 
maze, the rotating target of a pursuitmeter, a list of syllables on a mem- 
ory drum, a reflex to be attached by conditioning to a new stimulus, and 
a mathematical problem to be solved by reasoning. 

The process dimensions are correlated in a coarse way with the 
situational dimensions, but are distinguishable. They include such con- 
siderations as the amount of discovery required before the correct re- 
sponse is made (a distinction between rote learning and problem-solving ), 
the involvement of motor or ideational responses (as between motor skills 
and verbal memory), and the relevancy of motivating conditions ( inci- 
dental learning as against intentional learning ). 

Experiments have to be classified according to more than one dimen- 
sion. Thus rote learning and reasoning experiments are alike on the di- 
mensions verbal-nonverbal, but unlike with respect to the amount of 
discovery involved in the adequate response. It is important to recognize 
that the classification is not according to fixed types, but always according 
to scales that have intermediate values. Learning is not blind on the one 
hand and insightful on the other; there are degrees of understanding in- 
volved from a minimum at one extreme to a maximum at the other, with 
most cases falling between these extremes. 

3. The functionalist is an experimentalist. In its modern form, func- 
tionalism is dedicated to the experimental method. The issues upon which 
the functionalist is so tolerant become a part of his science only when 
they are translated into experimentable form. He is free of constraint In 
his choice of dimensions, and may choose to set up a dimension of items 
graded for similarity or a dimension of tasks graded according to degree- 
of-understanding. He prefers to drive general issues back to specifics 
before he is led into controversy over them, for he believes that many 
linguistic difficulties fade when reference is made to the specific findings 
of experiments. 

American psychology so generally accepts the three points which 
have been used to characterize functionalism that functionalism has be- 
come an almost unrecognized ground for mutual understanding. There 
is, however, a lingering bias detectable among leading functionalists that 
tempers the judgment that functionalism is neutral in respect to the major 
quarrels among learning theorists. This is the bias toward association 
theory, and the environmentalism that so often accompanies association 
psychologies. The affiliation with historical association theory has been 



336 THEORIES OF LEARNING 

recognized by leading functionalists. 15 The preference for environmental- 
ism shows up largely in the treatment of perception, where the issue as- 
sociated with the differences between Helmholtz and Hering has been 
kept alive in differences between gestalt psychologists and associationists. 
The functionalist prefers to interpret perceptual discriminations as 
learned. 16 Although these biases are not absolute ones, when there is room 
for doubt (as in some aspects of size constancy) one can predict that the 
functionalist will be on the side of empiricism and against nativism. 

In summary, we find the functionalist tolerant but critical, favoring 
continuities over discontinuities, seeking to translate his problems into 
experimentable form. Within the free, eclectic atmosphere, he neverthe- 
less has a preference for interpretations coherent with historical associa- 
tion theory as against holistic or nativistic interpretations. 

FUNCTIONAL ASPECTS OF THE LEARNING 
PROCESS 

Because the functionalist does not have a highly articulated learning 
theory, we can best understand his approach by following a functional- 
ist’s analysis of the problems facing an experimental psychology of learn- 
ing. In what follows we shall accept Melton’s ( 1950) summary of learning 
as a representative functionalist statement, using his analysis of learning 
and his outline of topics. 



FIGURE 45. The readjustive process. After Dashiell (1949), page 26. 

Melton begins with Dashiell’s diagram of the readjustive process 
(Figure 45). The diagram shows how a problem arises when ongoing ac- 
tivity (1) is blocked by an obstacle (2). Eventually the organism through 

15 Carr (1931), Robinson (1932a). Melton (1950) describes his position as an 
“associationistic functionalism.” 

16 E.g., Carr ( 1935). The favorite experiment quoted by functionalists has long been 

that of visual experiences with reversing lenses, Stratton ( 1896) . ° 
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varied behavior (3) solves the problem (4) and proceeds on his way (5). 
If the process is repeated, the adequate response (4) recurs in less time, 
with less excess and irrelevant activity. 

Thus the learning process is primarily a matter of the discovery of the 
adequate response to a problem situation and the fixation of the satisfying 
situation-response relationship. 1 7 

From this analysis, Melton is led to state several major experimental 
problems of learning. Expressed as topics, these are: 

1. Motivation 

2. Initial discovery of the adequate response 

3. Fixation and elimination 

4. Factors determining the rate of learning 

5. Transfer of training and retention 

We can follow this outline in characterizing the functionalist ap- 
proach to these topics. 

1. Motivation 

Woodworth was the first functionalist to give a treatment of motiva- 
tion in modern form, in a small book entitled Dynamic psychology ( 1918). 

It was this book that gave currency to the term “drive” as alter- 
native to “instinct.” Woodworth set the stage for the prominence that 
drives and motives were presently to have within psychology. 

The cycle between aroused activity and quiescence is discussed ac- 
cording to preparatory and consummatory reactions, Woodworth crediting 
Sherrington with the distinction. Consummatory reactions are those 
which satisfy basic drives or needs, activities such as eating or escaping 
from danger. They are directly of value to the organism. Their objective 
mark is that they terminate a series of acts, leading either to rest or to the 
turning by the organism to something else. A preparatory reaction is only 
indirectly or mediately of value to the organism, its value resting on the 
fact that it leads to or makes possible a consummatory reaction. The ob- 
jective mark of the preparatory reaction is said to be that it occurs as a 
preliminary stage leading to the consummatory reaction. 

Preparatory reactions are of two kinds. One kind represents alert 
attention, “a condition of readiness for a yet undetermined stimulus that 
may arouse further response.” Another is the kind of preparatory adjust- 
ment evoked only “when the mechanism for a consummatory reaction 
has been aroused and is in activity.” The latter sort of reaction is goal- 

17 Melton ( 1950), page 670. 

18 Sherrington ( 1906). 
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directed, and evokes seeking behavior, as when a hunting dog, having lost 
a trail, explores about in order to get back on it. 

A new conception introduced is that of ongoing activity as itself a 
drive — activity once under way leading to its own completion. Even if 
the original drive under which the activity started has ceased to operate, 
the activity, if incomplete, may continue. The distinction between drive 
and mechanism breaks down when it is asserted that the mechanism (i.e., 
the reaction under way) may become its own drive. Allport later (1937) 
proposed that motives may become functionally autonomous , that is, com- 
pletely divorced from their primal sources in basic needs of the organism. 
Activity leading to further activity, and functional autonomy, are very 
similar conceptions. Both are related to the problem of secondary rein- 
forcement. If stimuli associated with reinforcement themselves come to 
have reinforcing power, it may be that a system of derived rewards may 
be built up which will continue to have motivating power without re- 
course to the satisfaction of any basic need. The increasing importance 
attaching to secondary reinforcement in theories like Hulls and Skinner s 
shows Woodworth's perspicacity in being the first to recognize that the 
distinction between drive and mechanism becomes less sharp as activity 
gets under way. 

Another facet to Woodworths motivational theory is the abiding 
interest that he has shown in the concept of “set,” dating from his early 
experiments on imageless thought. It was the new emphasis brought about 
by the imageless thought controversy that in the first decade of this 
century provided functionalism's supplementation to traditional associa- 
tion psychology. Although Angell and Woodworth were on opposite sides 
of the fence on some issues concerned with the introspective contents of 
thought, they were in agreement on the dynamic consequences of pre- 
disposing sets and intentions. 19 

The sets and intentions emphasized by the students of imageless 
thought, and the preparatory reactions oriented toward consummatory re- 
sponses are concepts with a similar dynamic loading. They are both di- 
rectional, they are both inferred from the manner in which behavior se- 
quences are organized. In a later development of his conception of set, 
Woodworth introduces what he calls the situation-set and the goal-set 
(1937). The situation-set refers to adjustments to environmental objects; 
the goal-set refers to the inner “steer” which gives unity to a series of varied 
but goal-directed activities. 

Woodworth ( 1906); Angell ( 1906). 
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The important suggestion in the concept of situation-set is that the 
environment must be brought into the psychological account in a manner 
not suggested by the usual stimulus-response psychology. Guthrie, for 
example, makes much of the fact that he is interested in movements, not 
in outcomes. Woodworth takes the opposite position. Movements are, to 
Woodworth, less important than the information about the environment 
to which these movements lead. The rat in the maze learns spatial re- 
lationships, not running habits; in the problem-box the cat discovers the 
door-opening character of the button, not a sequence of manipulatory 
movements. As Woodworth puts it: ‘‘The character of goodness attaches 
for the animal not to his movements in manipulating the button, but to 
the button as an object/’ 

It is to Woodworth s credit that his long history of espousing stimulus- 
response concepts did not so fix his thinking that he was unable to go 
outside this systematic framework as he felt it necessary. The conception 
of situation-set presents a kind of bridge to ways of looking at psychology 
that have been carried out more fully by others, notably by Tolman and 
Brunswik. Through the years, Woodworths use of “set” and related con- 
cepts has prevented his falling into a static system. 

Carrs interpretation of motivation in his textbook (1925), was in- 
fluenced by Woodworth s views, but restated the preparatorv-consumma- 
tory adjustment in new terms. 

Carr accepted the principle that all behavior is initiated by stimuli, 
so that the point of view is similar to other stimulus-response psycholo- 
gies, with the difference that responses may be ideational as well as motor. 
He is concerned with activities, not movements. An adaptive act is de- 
scribed as involving a motivating stimulus, a sensory situation, and a 
response that alters that situation in a way that satisfies the motivating 
conditions. The external object by which the organism satisfies its motives 
is called the incentive, or the immediate objective or goal of the response. 
(The more remote goal, such as maintaining the fife of the organism by 
preventing starvation, does not enter directly into the control of the 
act.) More specifically, a motive is described as a relatively persistent 
stimulus that dominates the behavior of an individual until he reacts 
in such a manner that he is no longer affected by it. This makes of a motive 
what Smith and Guthrie call a maintaining stimulus. The adaptive act is 
organized to obtain certain sensory consequences (i.e,, the termination 
of the motivating stimuli) and the act can be interpreted as completed 
only when the consequences are attained. The interpretation of the end 
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phase as a sensory change became Carr’s way of talking about the em- 
pirical law of effect. 20 

Melton, accepting Carr’s definition of motivation as a stimulating 
condition, points out that motivating conditions have three functions: (1) 
they energize the organism, making it active, (2) they direct the variable 
and persistent activity of the organism, and (3) they emphasize or select 
the activities that are repeated (fixated) and those that are not repeated 
(eliminated). Stated in this way, the functionalist’s interpretation of mo- 
tivation is consistent with Hull’s, but it is coherent also with the position 
of Tolman and Lewin. 

2 . Initial Discovery of the Adequate Response 

Melton points out that initial discovery may be the main problem in 
some kinds of learning (e.g., in the problem-box), while it plays little part 
in others (e.g., in serial rote memorization). The controversy between 
trial and error and insight in the interpretation of learning hinges in part 
on the nature of this initial discovery of the correct solution. The function- 
alist does not have to take sides; he waits for definitive experiments. 

Melton notes that two other forms of initial discovery have played 
less systematic roles in learning theories, yet they should not be neglected. 
One of these is guidance . The discovery of the correct response can be 
facilitated by the teacher or trainer through appropriate manipulation of 
the environment or the learner. Hence guided learning is intermediate 
between rote learning (with no discovery) and problem-solving (with un- 
guided discovery). An extensive research program on guided learning was 
carried out by Carr and his students and later summarized by him. 21 Such 
a program illustrates the freedom with which the functionalist selects his 
experimental variables. 

The other form of initial discovery mentioned by Melton is imitation . 
He believes it can be subsumed under guidance, however, for it helps the 
learner to discover the solution by observing the solution by another 
organism. 

Melton, in common with other functionalists, is willing to accept 
the initial discovery of solutions with insight, but, in common with other 
associationists, prefers interpretations emphasizing past experience: 

Insight is, however, not a term to conjure with even though it is a useful 
descriptive concept when employed within the framework of the major dimen- 
sions of analysis of the problem situation and the process of discovery of the 
correct response. In any event, the range, specific nature, and plasticity of the 

2° Carr (1938). 

Carr (1930). 
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trial behavior of the organism seem to be most profitably related to the motivat- 
ing condition of the organism and to the associative spread from previous learn- 
ing>‘ 


3 . Fixation of Adequate Responses and Elimination 
of Inadequate Responses 

Melton accepts the empirical law of effect as stated by McGeoch: 
“Other things being equal, acts leading to consequences which satisfy a 
motivating condition are selected and strengthened, while those leading 
to consequences which do not satisfy a motivating condition are elimi- 
nated/' When it comes to explaining effect, Melton is cautious in choos- 
ing between a contiguity theon and a reinforcement theory: 

Two alternatives seem to be available to the student at this time: (a) The 
issue of contiguity vs. effect will be decided at some distant time in terms of the 
over-all fruitfulness of one or the other postulate, as judged by the application 
of Hulls theoietical mtthodolog) ; (b) It may be that mere contiguity and rein- 
forcement are extremes of some as yet undefined dimension of the neurophysi- 
ology of the organism, such that associations may be formed through mere 
contiguity or through leinforcement. Meanwhile, it is necessary to recognize the 
experimental findings which make the complete generality of one or the other un- 
tenable, for these are actually the fruits of the research on this problem during 
the forty years. 21 

This quotation well illustrates the functionalists unwillingness to 
prejudge the outcome of a controversy whose resolution ultimately must be 
determined by experiment. 

4 . Factors Determining the Rate of Learning 

Whatever may be the underlying principles of learning when learn- 
ing is reduced to the barest essential relationships between stimulus and 
response, it is not difficult to arrange situations under which learning 
occurs. By Ion ^-established convention we plot performance scores against 
learning trials and come out with a “curve of learning/’ The search for a 
"typical” or “time" form of the learning curve has not proven profitable, 
but learning curves have helped in the search for the parameters of learn- 
ing. Through a great many experiments we have learned much about the 
relationship between rate of learning and the amount of material to be 
learned, about the effects of length and distribution of practice periods, 
about how the characteristics of the learner affect the rate of progress. 
These problems have all been inviting ones to the functionalist who sets 
as his task the exploring of the many dimensions influencing learning. 

-- Melton ( 1950), page 676. 

McGeoch ( 1942), page 574. 

- 4 Melton ( 1950), page 677. 
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Some guiding principles are needed to establish order in an em- 
pirical program of such vast scope as accounting for everything that affects 
all kinds of learning by all kinds of learners under all kinds of circum- 
stances. Some of these guiding principles were inherited from the pre- 
experimental association psychologists. A useful list had been provided by 
Thomas Brown as early as 1820. He had classified the laws of association 
(or laws of suggestion, as he called them) into primary or qualitative 
laws (the laws of similarity, contrast, and contiguity), and secondary or 
quantitative laws (frequency, vividness, emotional congruity, mental set, 
etc.). The primary laws represent the essential general conditions for as- 
sociative formation or associative revival, whereas the secondary laws 
determine which of many possible associates is formed or recalled. 

After Ebbinghaus (1885) had shown that associations could be 
studied experimentally, writers such as Muller (1911-1917) in Europe 
and Carr (1931) and Robinson (1932a) in America began to rewrite as- 
sociative laws in quantitative form. Carr and Robinson were quite explicit 
that all associative laws could be made quantitative, and they gave lists 
of laws that they thought expressed our twentieth-century knowledge 
about associative learning. 

Here is Robinson s list of laws: 25 

Law of contiguity 

Law of assimilation 

Law of frequency 

Law of intensity 

Law of duration 

Law of context 

Law of acquaintance 

Law of composition 

Law of individual differences 

While in topic some of Robinsons laws sound qualitative, he attempts 
to show how all qualitative distinctions can be subjected to a dimensional 
analysis. He himself raised the question whether or not in isolating these 
factors as important he had really stated “laws.” He summarized his dis- 
cussion of “laws” as follows: 

It is my firm conviction that the facility of associative fixation is a function 
of all of the factors enumerated. Probably several specific relationships are 
involved for each named factor and almost certainly there are other factors 
that have not been included in this list. But, if the assumption that these factors 
are important determiners of association be correct, then there are “laws” of these 
factors whether our knowledge of them is definite or not. 26 

25 The laws are stated in Robinson { 1932a), pages 62-122. 

2 * Robinson (1932a), page 124. 
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It turns out that Robinson s position is much like Skinners, In that 
he points out the relevant variables and makes a claim for “lawfulness” 
rather than for “laws.” Because of these limitations in the list of “laws,” 
the Carr-Robinson laws did not catch on, even among their students, as 
a means of ordering the factors determining the rate of learning, although 
there are occasional mentions of isolated laws from the list. The issues 
expressed in the laws were somewhat too general. The empirical role of 
contiguity,* 7 for example, breaks down into the stud)’ of several varieties 
of time relationships between events, with their corresponding gradients. 
Thus the separation of a conditioned and unconditioned stimulus in clas- 
sical conditioning yields one kind of empirical law of contiguity, while 
the delay of reward at the end of a maze yields another kind. We tend 
to name such relationships more concretely (e.g., gradient of reinforce- 
ment, spread of effect) rather than to refer them to a more general law 
of contiguity. So, too, the law of frequency, as a law of relative frequency, 
covers too many topics, including the form of the learning curve, the form 
of the work curve, the effects of distributed and massed practice, the 
consequences of degrees of overlearning. Frequency is only a general 
topic, within which a number of very different laws are discoverable. 

Contemporary functionalist writers, instead of setting up basic laws 
of learning, tend to use classifieatory schemes derived directly from ex- 
perimental arrangements and results. Underwood, for example, uses the 
following scheme to outline the factors determining the rate of learning: 

a. Massed versus distributed practice 

b . Type of material 

( 1 ) Intra-list similarity 

(2) Meaningfulness 

(3) Aff activity 

c. Knowledge of performance 

d . Miscellaneous (e.g., whole versus part learning; active recitation; sense 

modality; amount of material) 28 

While Underwoods treatment is part of a chapter in an introductory 
textbook on experimental psychology and is not intended to be exhaustive, 
his classification of variables bears a close resemblance to that used in 
other functionalist treatises on learning. 29 Such a classifieatory scheme is 
not in itself a theory of learning, yet it suggests as well as the Carr-Robin- 
son laws the kinds of variables with which functionalist systems deal. 

- 7 To be distinguished from the fundamental axiom of contiguity as in Guthrie's 
theory. The empirical problem of contiguity is a dimensional one, concerned with 
degrees of contiguity. 

28 Underwood ( 1949), pages 398-419. 

29 Major summaries of the literature handled functionally can be found in McGeoch 
and Irion (1952), Osgood (1953), Woodworth and Schlosberg (1954). 
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5, Transfer of Training and Retention 

Once something has been learned it can be used, provided it has not 
been forgotten and provided new situations recur in which the previously 
learned behavior is called forth. The study of the relative permanence 
of learning, when tested in situations essentially duplicating those of the 
original learning, is the study of retention , while the study of the effects of 
old learning in new situations is often discussed as transfer of training. 
Because recurring situations always recur with differences, there is ob- 
viously an intimacy between retention and transfer. Because transfer ef- 
fects may be positive, negative, or indeterminate, one view of forgetting 
is that it is but an illustration of negative transfer. 

Melton follows McGeoch (1932a) in accepting two major laws of 
forgetting. The first is the law of context (one from Robinson’s list), which 
asserts that the degree of retention, as measured by performance, is a 
function of the similarity between the original learning situation and the 
retention situation. The second is the law of proactive and retroactive in- 
hibition , which asserts that retention is a function of activities occurring 
prior to and subsequent to the original learning. Proactive and retroactive 
inhibition have been major topics of research for several decades, ever 
since the problem was first opened up by Muller and Pilzecker (1900). 
A number of good summaries of the literature are available. 30 

The paradigm for retroactive inhibition is A — B — A, where the 
learning of B is interpolated between the learning and retention of A, and 
interferes with the retention of A. The paradigm for proactive inhibition 
is B — A — A, where the learning of B prior to the learning of A interferes 
with the later retention of A. Both retroactive and proactive interference 
with learning are readily demonstrable, and the empirical relationships 
have led to a number of hypotheses. Studying the development of one of 
these hypotheses will help us to understand not only retroactive inhibition 
but the manner in which functionalists construct their theories. 

One set of problems arises over the similarity between the interpolated 
material and the material originally learned. Robinson (1927), arguing 
from some earlier results of his own and of Skaggs (1925), formulated a 
hypothesis later christened by McGeoch as the Skaggs-Robinson hypothe- 
sis. With the usual functionalist preference for stating dimensions, Robin- 
son proposed relating the amount of retroactive inhibition to the dimen- 

30 For the earlier literature, see Britt (1935) and Swenson (1941). A general sum- 
mary can be found in McGeoch and Irion ( 1952 ) , 404-447. 
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sion of degree of similarity between the original and the interpolated 
material or activity. 

With the similarity dimension in mind, Robinson argued that the 
interpolation of identical material (Material B the same as Material A) 
would simply provide additional practice on Material A, hence lead to 
increased retention on the test trials during which retroactive inhibition 
is usually shown. Because retroactive inhibition with dissimilar materials 
was already an established fact, the natural conjecture on the assumption 
of continuous variation is that starting with identity the amount of retro- 
active inhibition would increase gradually as dissimilarity was increased. 
Now, asks Robinson, what is likely to happen at the other end of the 



DEGREE OF SIMILARITY BETWEEN INTERPOLATED 
ACTIVITY AND ORIGINAL MEMORIZATION 

FIGURE 46. Similarity as a factor in retroactive inhibition. The curve is intended 
to show that retroactive inhibition bears a quantitative relationship to the degree of 
similarity between the interpolated activity and the material originally memorized. 
With maximum similarity, the interpolated activity provides positive transfer, hence 
increases the efficiency of recall. Maximum interference with recall is predicted to 
fall at some intermediate value of similarity. After Robinson (1927), page 299. 

scale, as the original material (Material A) and the interpolated material 
( Material B ) become extremely unlike? Presumably retroactive inhibition 
represents some sort of interference based on similarity between the 
original and interpolated activity. If there is very little similarity there 
should be very little retroactive inhibition. Putting all these considerations 
together, it is reasonable to expect a maximum of retroactive inhibition at 
some intermediate point of similarity between Materials A and B. Robin- 
son formulated the whole generalization in words as follows: “As similarity 
between interpolation and original memorization is reduced from near 
identity, retention falls away to a minimum and then rises again, but 
with decreasing similarity it never reaches the level obtaining with maxi- 
mum similarity/' He expressed this graphically by the figure reproduced 
as Figure 46. 




346 


THEORIES OF LEARNING 

His own experimental test of the generalization was very simple. By 
the memory span method he studied the recall of the first four of a series 
of eight consonants as this recall was interfered with by the last four of 
the consonants. That is, the first four were considered to be Material A, 
the last four Material B, and the similarity and dissimilarity of Materials 
A and B were controlled. Similarity was defined as partial identity. Maxi- 
mum similarity meant that the second four consonants were like the 
first four; maximum dissimilarity meant that all the last four differed 
from the first four. 

The results confirmed the hypothesis only partially. Starting with 
near-identity, retroactive inhibition increased as the interpolated material 
became increasingly dissimilar. This was part of the conjecture. But the 
decrease in the amount of retroaction (increase in recall) with maximum 
unlikeness was not found. In fact, with the materials totally dissimilar, 
retroactive inhibition was at a maximum. 

A number of later experimenters attacked the problem after Robin- 
son formulated it in this way. 31 Results gave partial confirmation, but 
there was some incompleteness and minor contradictions appeared in the 
data. The results led Osgood (1949) to propose a more complex formula- 
tion of the relationships involved. Osgoods solution of the similarity 
problem is diagrammed in the three-dimensional surface shown as Figure 
47. 

What Osgood’s surface states is that the amount of transfer in positive 
or negative directions is a function of shifts in similarity between learn- 
ing and retention of both the stimulus conditions and the response re- 
quired. Shifts in stimulus similarity are from front to rear as noted on the 
right-hand margin, moving from identical stimuli (S x ) through similar 
stimuli ( S s ) to neutral stimuli ( S N ) . Shifts in response similarity are repre- 
sented from left to right, as noted along the back margin, with identical 
responses (R x ) being at the left, and moving progressively through similar 
responses (R s ), neutral responses (R N ), partially opposite responses (R 0 ) 
to directly antagonistic responses (R A ). 

The best way to practice reading the diagram is to read its edges first. 
The rear edge says that stimuli bearing no resemblance to those used in 
original learning lead to new responses without any transfer effect, positive 
or negative, regardless of the degree of resemblance between the required 
responses and responses that have been used in earlier experiments. The 
front edge says that with identical stimuli there will be maximum positive 

31 Mention should be made also of a more rational analysis of the form of the 
Skagg-Robinson function by Boring (1941). 
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transfer with identical responses (for this is merely overlearning), whereas 
with directly antagonistic responses there will be maximum interference, 
for the earlier responses will have to be completely unlearned or over- 
come. The curve moves with two inflections between these extremes, but 
these inflections are not actually determinate from present data. The left 
edge says that, with identical responses, shifts in stimulus similarity from 



FIGURE 47. Osgood’s transfer and retroaction surface (after Osgood, 1949). 
Vertical dimension, amount of transfer (-f ) or interference ( — ), with neutral zone 
represented by a plane (0). Left to right, amount of shift in response similarity 
between original task and new task, from identity (Ri) to antagonism (Ra). Front to 
rear, amount of shift in stimulus similarity between original and new task, from 
identify ( Si ) to neutrality ( Sn ) . 


identity to neutrality will result in decreasing transfer, but no interference 
in new learning. The right edge says that, for antagonistic responses, shifts 
in stimulus similarity from identity to neutrality will produce decreasing 
interference, but no positive transfer. The diagram is a surface, and yields 
a curve wherever it is cut by a vertical plane. 

In Robinson s kind of experiment the response called for varies with 
the stimulus, so if the stimulus changes the response also changes. (This 
is typically true in serial rote learning, because the response item is also 
a stimulus item for what follows. ) Hence the Skaggs-Robinson diagram 
(Figure 46) can be approximated by making a diagonal slice across 
Osgoods surface (Figure 47), from the near left-hand corner to the rear 
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point marked Rn. The near left-hand corner represents the Skaggs- 
Robinson condition of maximum task similarity ( identity of both stimuli 
and responses between learning and testing), while the rear point repre- 
sents neutrality of both stimuli and response relative to the original task. 
Such a diagonal curve will be high at the left, dip down into a zone of 
interference, and return to the neutral point at the rear, thus having the 
essential characteristics of the Skaggs-Robinson curve, though with some- 
what different inflections. 

We may trace the evolution of the models to account for the role of 
similarity. First we have the experiments that demonstrate the existence 
of retroactive inhibition, and then some that demonstrate the possible role 
of similarity as a factor. Next we have Robinson’s somewhat crude dimen- 
sional hypothesis, leading to a series of experiments. Meanwhile a series 
of related experiments on transfer effects are being conducted, as though 
the two phenomena have little in common. Presently Osgood offers his new 
synthesis, covering the data that have accumulated since Robinson s 
hypothesis was announced, incorporating at once the retroactive inhibition 
and the transfer results. The Skaggs-Robinson hypothesis is seen to be a 
limited special case of a more complex set of relationships. This succession, 
with an interplay between data and theory, and a higher synthesis in 
dimensional terms, is exactly what might be expected in a maturing 
functional analysis. The next step, as Osgood points out, ought to be a 
quantitative (mathematically precise) model of his surface. While the 
surface now accounts in a general way for all of the available data, it is 
largely schematic, and nothing is known about the precise form it should 
take, or if, indeed, there is one form or a family of forms. 

The discussion of Meltons topics has led us somewhat afield from the 
content or conclusions of his own treatment of learning, but the outline 
served its purpose in providing the topics for an overview of the functional- 
ist analysis of the problems of learning. 

Does this emerge as a theory of learning? Melton emphasizes in his 
summary that a dimensional analysis of learning, although scarcely qualify- 
ing as a theory, is not anti-theoretical. Theoretical systems, he states, help 
to clarify problems and they lead to the discovery of new empirical re- 
lationships. But these relationships have to be firmly established if we 
are to have a science of learning. We need to standardize and calibrate 
valid and reliable methods of investigation in order to have better data 
for the theorist to manipulate, 
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EFFECTS OF THE DISTRIBUTION OF PRACTICE AS 
ILLUSTRATIVE OF CURRENT FUNCTIONALISM 

One of the best ways to study a scientific theory is to see it in action, 
that is, to note how those who hold to the theory select their topics and 
design their experiments and how they analyze and interpret their data. 
We have followed this procedure in reviewing the theories thus far dis- 
cussed. We have had some introduction to the functionalist at work on 
the problem of retroactive inhibition. Now we are ready to examine 
further how functionalism works through giving more detailed considera- 
tion to some selected experimental problems. 

How the General Orientation Contributes to the 

Planning of Experiments 

When we attempt to determine how the theory directs the experi- 
ments that the functionalist chooses to perform we note two characteristics 
of his approach. 

We discover, first, that the general theory provides only an orienta- 
tion to experimentation, without providing a framework of crucial laws 
and postulates. The dimensional approach is a very broad frame of ref- 
erence within which to work. It would be entirely unmanageable and 
cumbersome were it not for the guiding threads provided through years 
of experimentation by familiar methods upon familiar topics. Consider 
what would be involved in studying all the dimensions of massed and 
distributed practice. There are all the variables common to learning ex- 
periments (subjects, nature of materials, interrelationships within the 
materials, the kind of learning task), and there are the special variables 
specific to the problem of distribution (location of the interval between 
items, between trials, between banks of trials, between learning and re- 
tention, within relearning; length of the interval, whether uniform or 
variable; nature of the activities filling the interval). The combinations 
and permutations are staggering. 

The second thing that we find out is that the experimenter typically 
narrows his task (within the broad dimensional orientation). He does 
this in two chief ways. First, he falls back on tasks that have familiarly 
been used in these studies, chiefly rote memorization and motor learning. 
He further restricts himself by addressing his attention to some of the 
“special theories” that are current or that he may himself invent. Hence 
the functionalist depends not only upon his general orientation but upon 
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a body of experimental lore and a number of special theories that reduce 
his task to manageable proportions. 

The Testing of Special Theories Related to 

Distributed Practice 

In order to illustrate the foregoing interpretation of how the func- 
tionalist goes about his experiments, we shall examine a number of ex- 
periments studying distributed practice. One series of twelve experimental 
reports, by Underwood and his associates, well illustrates the nature of a 
systematic functionalist investigation. 32 While these investigators appear 
to be making a strictly dimensional analysis, the papers are in fact guided 
largely by several special theories, and the dimensions chosen are by no 
means all that might have been selected. We turn now to some of these 
special theories — some tested by Underwood, and others with which he 
was not concerned. 

1. The theory of differential forgetting ( McGeoch ). 33 McGeoch's 
theory states that in the course of memorization or practice a subject learns 
erroneous (conflicting) responses that retard progress. Because these 
errors are less well learned than the correct responses, lapse of time will 
permit the errors to be forgotten more rapidly than the correct responses. 
Hence distributed practice will be advantageous over massed practice. 
In serial memorization some of these errors are remote associations (place- 
skipping tendencies). Because only the correct responses are repeatedly 
reinforced, the correct responses are probably better fixed than the errors. 
Experimenters who have looked for the forgetting of errors, as implied 
in McGeoch s theory, have been unable to find them. In one of the most 
careful direct studies of the problem, Wilson ( 1949 ) found no fewer errors 
in distributed than in massed practice, even though distributed practice 
resulted in more rapid learning, and he failed to find predicted effects in 
retention tests. Later studies, e.g., Underwood and Goad (1951) have 
also failed to substantiate McGeoch’s theory. 

2. The generalization-differentiation theory (Gibson). 34 Gibson as- 
sumes that in the learning of a nonsense syllable the subjeot has to dif- 
ferentiate the individual syllables from nonsense-syllables-in-general. If 
the subject is making some correct responses and some erroneous ones, we 
may say, according to this interpretation, that the correct responses have 

32 These are Underwood and Goad (1951), Underwood (1951a) (1951b), Under- 
wood and Viiema (1951), Oseas and Underwood (1952), Underwood (1952a) 
( 1952b) ( 1953a) ( 1953b) (1953c) ( 1953d) ( 1954a). 

33 McGeoch (1942), page 142f.; McGeoch and Irion (1952), pages 183-188. 

33 Eleanor J. Gibson ( 1940 ) ( 1942 ) . 
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been better differentiated from syllables-in-general, while the erroneous 
responses still suffer from too great generalization, that is, lack of discrim- 
ination. What will be the effect of a rest interval? Because differentiation 
(by analogy with what goes on in conditioning experiments) involves an 
inhibition of alternative responses, rest should bring “spontaneous re- 
covery” of generalization. Hence the differences in the strengths of tend- 
ency to give correct and incorrect responses should be reduced after rest. 
Thus distributed practice should prove to be disadvantageous as compared 
with massed — an implication contrary to McGeoch s hypothesis. 

Experimental results are more favorable to Gibson's theory than to 
McGeoch’s. With distributed practice, erroneous responses commonly 
do show an increase over those in massed practice, especially during the 
early trials. 35 

How difficult it is to make predictions that are genuinely unam- 
biguous! We assumed, in stating Gibsons theory, that distributed practice 
should be disadvantageous. This was not really a sound assumption. It has 
been found that with distributed practice there are more errors, as her 
theory predicts, but learning may still be more rapid under conditions of 
distributed practice. It is possible that more errors are associated with a 
heightened set to respond that is in turn associated with more rapid learn- 
ing. Part of the mystery of the finding disappears if we remember that 
in addition to false responses there are other kinds of inadequate re- 
sponses in a serial learning experiment, such as a failure to respond, not 
counted as an erroneous response. 

Another aspect of Gibsons generalization hypothesis has to do with 
the intra-list similarity of items. If items in a list are very much like each 
other it will be harder to differentiate them. If the process of learning is 
one of differentiation, it should be harder to learn a list of similar (homo- 
geneous) items than a list of heterogeneous ones. This implication of the 
theory can easily be confirmed experimentally. 36 A second implication is 
that forgetting will be more rapid with greater intra-list similarity because 
of the spontaneous recovery of interfering generalizations. The interfer- 
ing generalizations will, of course, be more common with more similar 
items. This implication does not receive experimental support. Because of 
the support of Gibson's theory in the learning and relearning data, but 
not in the retention data, Underwood (1954b) was led to propose some 
supplements to her theory. 

36 Underwood and Goad ( 1951), Underwood ( 1951b) ( 1952b). But not all experi- 
ments agree with the theory, e.g,, Underwood ( 1951a ) . 

86 There is a family resemblance here between Gibson's theory and Kohlers. See 
page 246. 
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The results of four of Underwood’s studies 37 (with both serial- 
syllable and paired-syllable lists) agreed substantially on the following 
three points: (1) The higher the intra-list similarity the more difficult the 
learning; (2) retention over a' 24-hour interval is independent of the level 
of intra-list similarity; and (3) the higher the intra-list similarity the more 
difficult the relearning. 

Underwood is called upon to account for the unexpected failure of 
intra-list similarity to have an effect upon retention, although it clearly 
affects both learning and relearning. He proposes that there may be two 
kinds of interference that do not reveal themselves in retention because 
they balance each other: 

(1) If intra-list similarity is high, there is interference within the 
list, as predicted from Gibson’s generalization hypothesis. Underwood’s 
subjects learned and relearned three successive lists. His evidence shows 
a distinct trend for errors at recall to come from within the list when 
similarity of items within the list is high, and from outside the list when 
similarity of items within the list is low. Because some of these outside items 
came from previously learned lists, it is implied that there is inter-list inter- 
ference when intra-list similarity is low. 

(2) If intra-list similarity is high, the list being learned is more 
clearly differentiated from previously learned lists than if similarity is 
low. This accounts for less inter-list interference with high intra-list simi- 
larity. 

Combining these two propositions we note that there is an inverse re- 
lationship between intra-list and inter-list interference with variations 
in intra-list similarity. Thus when intra-list similarity is high, the high 
intra-list interference is offset by low inter-list interference. 38 Provided 
these opposing tendencies balance each other, we have an explanation for 
the finding that recall is independent of the degree of intra-list similarity. 

Evidence supporting these conjectures is given in Figure 48. The curves 
are based on the learning of successive lists of paired associates. Although 
early in practice the high-similarity lists are much harder to learn than 
the low-similarity lists, the curves converge with practice. High-similarity 
lists become relatively easier both because there is some learning-how-to- 
learn and because there is little inter-list interference. This accords with 
the hypothesis. Low-similarity lists become harder to learn, not because 
learning-how-to-leam is absent, but because there is increasing inter- 

sr The four studies are Underwood (1952b) (1953a) (1953b) (1953c). 

88 That better bounded material interferes less than loosely bounded material was 
shown earlier by Werner ( 1947 ) . 
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ference (proactive inhibition) from the learning of prior undifferentiated 
lists. (This, too, agrees with the hypothesis. ) 

The interpretation is that, as one of these interferences goes up the 
other goes down. Hence they may balance so that retention does not show 
any differential effects for high or low intra-list similarity. If these are 
balancing functions, it ought to be possible to give one a boost over the 
other. In one of Underwood's experiments in this series, 39 it was found 
that with extreme similarity learning remained difficult, but retention was 
improved. This result, too, conforms to his hypothesis. 



FIGURE 48. Relearning of paired-syllable lists as a function of stage of prac- 
tice and intra-list similarity. Underwood (1954b), page 163. 

3. The theory of recovery from work decrement. A very plausible 
theory of the advantages of distributed practice has been proposed from 
time to time by those who noted that massed practice had some of the 
characteristics of a "work curve” as well as a “learning curve.” This is 
particularly evident in motor learning. In Figure 49, for example, we have 
the performance curves of two groups on a pursuit rotor, one group of 
subjects practicing for one minute and resting for three minutes (dis- 
tributed practice), the other practicing for three minutes and resting 
for one minute (massed practice). Scores were recorded at the end of 
each minute of the 3-minute group, and these appear in the curve as a 
series of “work curves.” The gain from practice shows as recovery over 
the 1-minute rests, but there is little advantage in the three times as much 

39 Underwood ( 1953c ) . 
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practice that this group has had. The kind of theory that these curves 
suggests, supplemented by “warm-up” phenomena, has been given a 
more formal treatment by Ammons (1947), and works reasonably well 
for pursuit learning. 40 

The related theory for rote memorization was elaborated by Hull and 
others (1940), largely around the experimental work of Ward and Hov- 
land. 41 Here it was assumed that interferences accumulating within learn- 



FIGURE 49. Distributed practice in pursuit learning. Group A practiced 1 minute 
and rested 3 minutes, while Group B practiced 3 minutes and rested 1 minute. 
Scores at each minute within the 3-minute trials show progressive work decrement. 
Dor6 and Hilgard (1937). 

ing are of the nature of inhibitions in conditioning, and thus recover with 
time. A number of conjectures were borne out in the data: that there 
should be reminiscence after a rest following learning, that the disad- 
vantage for items near the middle of a list should be overcome somewhat 
with rest, that paired associate learning should show less benefit from rest 
than serial learning. 

The main objection to the theory was raised by one of its proponents 
Hovland (1940) — when he pointed out that the theory should predict 
that massed practice ought to lead to better retention than distributed 

40 Further work along closely related lines has been done by a number of others e/r. 
Kimble and Horenstein ( 1948), and Irion ( 1949). ' ' 

"Ward (1937), Hovland (1938) (1939). 
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practice. The argument is straightforward: In order to meet a criterion of 
mastery under inhibitory conditions, more has to be acquired than under 
less inhibitory conditions. Hence the amount retained, when the inhibi- 
tion vanishes, should be greater for the group that learned under a handi- 
cap, that is, the massed-practice group. 

Two remarks need to be made about Hovlands objection to the 
^theory. In the first place, the advantages of distributed practice for reten- 
tion are by no means universal. In this respect some of the data gathered 
by others support Hovlands theory better than his own. 42 In motor learn- 
ing it is not uncommon for the groups with scores depressed by massed 
practice to gain more following both short and overnight rests than groups 
with distributed practice. 43 In the second place, no single-factor theory 
can be expected to account for the complex data within all phases of the 
distributed practice problem. A theory cannot be completely validated by 
data that agree with it, nor can it be completely refuted by data that dis- 
agree with it. This follows if we admit that there may be competing 
processes operating, and in any experiment one of these processes may gain 
ascendancy over the other. We saw this possibility in Underwood’s sup- 
plementation of Gibson’s theory. 

Theories of the work-interference type, whether expressed in terms 
of conditioned inhibition, or reactive inhibition, or in other language, are 
bound to have some place in the explanation of distributed practice. They 
account well for the similar effects found for such unlike tasks as nonsense 
syllable memorizing, pursuit learning, and concept formation. Direct 
tests have heretofore been made largely with the motor tasks, but Hovland 
and Kurtz (1951) have come up with an experiment in which the results 
of work are tested directly within a memorization experiment. If a subject 
does difficult mental work of another kind just before memorizing a list, 
more reminiscence for the list follows than if he either did not do the 
mental work or was permitted to rest between the mental work task and 
the memorization. 

4. The consolidation theory . One of the earliest “special” theories 
bearing on distributed practice is that of Muller and Pilzecker (1900), 
known originally as the perseveration theory. The theory assumes that 
the benefits of practice continue for a while after practice ceases, and the 
results of learning are therefore not at their maximum until an appreciable 
time following the end of practice. Were the perseveration theory to hold, 

42 E.g., Underwood ( 1952b ) . 

43 Hilgard and Smith (1942), Cook and Hilgard (1949), 
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such phenomena as reminiscence and the advantages of distributed prac- 
tice would be expected. Related forms of the theory attribute the consoli- 
dation effect to growth or maturation stimulated by practice. 44 

One hypothesis derived from the theory of stimulation-induced matu- 
ration Is that, within the wide limits of amount of practice, learning should 
proceed at a rate that is a function of the learner and of the task to be 
learned rather than a function of the number of trials or of the actual 
amount of time spent in practice. Too little practice (or, at the extreme, 
none at all) will fail to stimulate the growth underlying the skill, while 
too much practice will prove to be disruptive and inefficient. 

The hypothesis has been criticized because it is said that its predic- 
tions may be made “on purely empirical grounds.” 45 That is, if we know 
that under some circumstances rest can produce as much gain as practice, 
there must be an intersection point at which two practice curves will 
overlap when plotted according to elapsed time, even though one curve 
is based on fewer trials than the other. While this is true, the fact that 
whole families of curves coincide, even within very wide ranges of dif- 
ference in the spacing of practice and rest, is not so easily predicted. The 
two curves from Dore and Hilgard already presented (Figure 49) show 
how the recovery from the detrimental effects of 3-minute practice periods 
brings the scores to about the level reached when practice and rest are 
very differently spaced. The corresponding scores for the beginning and 
end of each of four days of practice under three widely different con- 
ditions of spacing are shown in Figure 50. 46 Whatever one's theory, it is 
pertinent to note that the gains within each day and the overnight losses 
are remarkably similar despite the accumulated differences in the number 
of practice trials. The ordinary trial-by-trial plot would have shown strik- 
ing advantage for distributed practice without calling attention to a uni- 
formity that no other theory predicts. 

The validation or refutation of a theory is no simple matter. Although 
the results presented conform to the consolidation theory, other results 
from the same laboratory show no such correspondence to theoretical 
predictions. 47 When some predictions from a theory succeed and others 
fail, the failures mean that the theory, if not wrong, is at least incomplete. 

Other suggestions . It is characteristic of the functionalist to invent 

44 Snoddy (1935), Wheeler and Perkins (1932). 

46 McGeoch and Irion ( 1952), page 183. 

46 Similar curves for mirror-vision co-ordination as studied by Snoddy (1945) were 
replotted in the first edition of this book, Hilgard ( 1948), page 256. 

47 Bell (1942), Wright and Taylor (1949). The latter experiment was done with 
verbal learning. 
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ad hoc theories to explain particular findings. This tendency in itself is 
neither good nor bad, although at some stages of knowledge it produces a 
great conglomeration of unrelated theories, and inconsistencies are hard 
to detect. 

We may conclude this treatment of special theories by pointing out 
three suggestions not covered in the foregoing list of special theories. 



FIGURE 50. Similarity in trends with three distributions of practice in pursuit- 
learning. Group A by the end of the fourth day had practiced a total of 32 1-minute 
trials, Group B a total of 52 trials, and Group C a total of 72 trials. Data of Hilgard 
and Smith (1942) as plotted by Underwood (1949), page 406. 

The first of these suggestions is that of Cain and Willey (1939), who 
studied the retention of nonsense syllables following learning to a com- 
mon criterion of mastery by two groups. The massed-practice group 
learned the list in a single day. The distributed-practice group learned 
over three days, each day s practice being interrupted when the subject 
reached a preselected criterion short of mastery, but progressively higher 
each day. By the third day the distributed-practice subjects reached the 
same criterion of mastery that the massed-practice subjects reached in 
one day. The group that learned by distributed practice turned out 'to 
be far superior in retention to the massed-practice group, as tested at 
intervals of 1 day, 3 days and 7 days following learning. Partly because the 
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distributed-practice group showed no advantage over the massed-practice 
group in trials required to learn, Cain and Willey argued from the 
distributed-practice group s superior retention that this group must have 
learned something the massed-practice group had not learned. This was 
to pick up the task on the following day where it had been left off many 
hours before, even when it had been poorly learned. In other words, the 
distributed-practice group had two days of practice at recalling after 
24 hours that the massed-practice group did not have. While this is a 
plausible suggestion within the arrangements of this experiment, it is 
hardly suitable as a general interpretation of the advantages of distributed 
practice. It gives a warning, however, that qualitative changes occur 
when very long intervals are dealt with. It is always necessary to supple- 
ment mere scores by a psychological analysis of the activities going on. 

A second suggestion was originally made and tested by Ericksen 
(1942). He proposed that massing of trials should lead to variability of 
response, through factors similar to the refractory phase that act as 
barriers to repetition. If this is the case, massing should be advantageous 
when the task is one calling for variability of attack. He found some evi- 
dence for this theory for human subjects solving a problem-box. Earlier 
work by Cook (1934) had found massed practice to be an advantage in 
early trials of puzzle-solving, a result coherent with Ericksen s hypothesis. 
A later test by Riley (1952) failed to find benefit from massing in a task 
with many alternatives; distributed practice proved beneficial through- 
out. 

A third suggestion comes from Underwood (1954a), calling attention 
to individual differences as a factor in the effects of distributed practice. 
He found that retention by slow learners tended to suffer from distributed 
practice as shown in Figure 51. While he believes these results to account 
for some of the discrepant findings on the relation between distribution of 
practice and retention, the results do not suffice to account for all the 
discrepancies. The differences between the two kinds of learners are to be 
taken seriously, but the failure to find advantages with distributed practice 
even for the fast learners shows that something else is, operative in these 
experiments. 48 

These suggestions (and they do not exhaust the possibilities) show 
how complex a dimensional approach would become if it were to take 

48 One possibility is that the symbol cancellation task used to fill the interval has an 
interfering effect upon retention, and that this interference is more severe with the 
slow learners. The experiments that find more advantage for distribution have used 
dhiefi^ color naming, shown to be somewhat facilitating as against symbol cancellation. 
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seriously the task of exploring in systematic variation all of the factors 
that have been shown by one or another experiment to be influential in 
experiments on the distribution of practice. 




Concluding Comments on the Distribution of Practice 


An examination of experiments and theories related to distributed 
practice has helped us to see some of the characteristics of functionalism 
at work. First, the general position provides an orientation towards dimen- 
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FIGURE 51. Deleterious effects of distributed practice upon retention for slow 
learners compared with fast learners. Underwood (1954a), page 298. 

sional experimentation. If intervals are important, then we explore an 
orderly series of intervals; if similarity is important, then we find a way 
of grading similarity and we explore the effects of several degrees of 
similarity. Second, however, the attack is always much more specific than 
the general orientation alone would suggest. Not all or just any dimensions 
are explored, for such an approach would be too cumbersome; instead, 
the methods and materials used tend to be restricted largely to familiar 
ones, and the particular dimensions chosen for investigation are usually 
related to one or more of the special theories that are current. The pro- 
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gram is very much that of science in general. It is very similar to that 
proposed by Skinner, to the extent that it is dimensional and seeks lawful- 
ness rather than laws; it is related to Hull in that the tests are commonly 
tests of hypotheses. It lies between these two positions in that the ex- 
perimenters who are functionalists usually have more respect for hy- 
potheses than Skinner has, and they are more cautious than Hull about 
general systematic formulations at this stage of knowledge. 

Does the approach pay off in firm scientific knowledge? The answer, 
at least if we generalize from studies of distributed practice, is by no 
means unequivocal. We are left at present with a welter of special theories, 
mostly tested with undecisive results. What is the matter? 

One embarrassment is the failure of later experimenters to agree in 
their findings with the results of earlier experimenters. If we cannot find 
reproducible experimental data, it is futile to try to reach agreement on 
theoretical predictions. Possibly these failures to repeat are due to slip- 
shod experimental procedures, but this is unlikely. Most experimenters 
are careful in what they do and report their results accurately. A more 
plausible diagnosis of failure to come up with identical findings when 
experiments are repeated can be made along the following lines; 

1. Experiments are seldom repeated without “small” or “trifling” 
changes. Strictly speaking, we are not interested in repeating earlier ex- 
periments precisely , for then we get no new information. Hence we in- 
troduce what we believe to be non-essential changes (such as a different 
task in rest intervals), hoping to reproduce the essential findings. If the 
same findings are indeed the outcome, no problem arises. But if we do not 
find the same results, then we do not know whether or not we would have 
found them had we not made the “trifling” changes. Because of the many 
difficulties in so-called repetition, it is essential that we be patient enough 
to repeat exactly what other experimenters have done before we begin to 
make “small” changes. 49 

2. Experimental designs are permitted to become too complex, with- 
out prior psychological analysis of the processes under investigation. Ex- 
perimental design is not solely a statistical problem. While of course re- 
sults must be secured in a maimer to make possible adequate statistical 
analysis, the subjects task must also be reckoned with. The counterbal- 

49 Those not close to the experimental literature have no idea how serious this non- 
repeatability of experimental results has become, A notable example is the Ward- 
Hovland reminiscence phenomenon, which led Buxton ( 1943 ) to ascribe to it the char- 
acter of “now-you-see-it-now-you-don’t.” Underwood (1953e) found so little evidence 
for it in his review of later studies that he designated it as possibly "a prewar phe- 
nomenon.” 
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ancing of practice effects by using the same subjects in various cycles 
of an experiment is likely to prove not to be an economical procedure. 
Underwood s results as presented in both Figures 48 and 51 show how 
futile it is to try to equate effects at different stages of practice when 
subjects of different ability are used and when different kinds of tasks 
(e.g., paired associates versus serial learning) show very different effects 
early and late in practice. The simpler design, well represented in the 
experiment of Wilson (1949), in which each subject is used in but a 
single condition, gives all the advantages of applicability of modern 
analysis of variance statistics while permitting much better control of the 
psychological processes involved. It has the further advantage that new 
controls can be run without going through an elaborate counterbalanced 
series in which one cannot fathom what confusion has been created for 
the subject. 

These two suggestions are related. It is too costly to repeat exactly 
very involved counterbalanced designs. If simpler designs were used, it 
would be possible to repeat the particular cycle in which the next ex- 
perimenter was interested, in order to show that under the original con- 
ditions the original results could be obtained. Then the experimenter is 
free to make any changes he wants, with some assurance that his changes 
bear upon the modified results that he obtains. 

A second source of embarrassment is that even the special theories 
appear to have a long life and seldom are disposed of by way of experi- 
mental evidence. The only special theory related to distributed practice 
that now seems ready to be dismissed is that of differential forgetting, for 
no experimenter has found the error items that he ought to find if the 
theory is true. The consolidation theory is not usually treated as very 
promising, yet no theory has been proposed to predict concretely the re- 
sults that happen to fit it. The theories based on warm-up and recovery 
from work decrement satisfactorily explain some, but not all, of the re- 
sults. So, too, Gibsons generalization-differentiation theory remains in 
the running with some substantial agreements between data and predic- 
tions, but some failures also. 

It is perhaps to the functionalist s credit that he can tolerate this 
mass of material so loosely integrated, rather than blinding himself to 
some of it in order to make his theories sound more plausible. Yet it 
would be even more to his credit if he could invent a theory that would 
give a coherent account of the whole array of relationships. We may note 
a dilemma here. Special theories can be stated with great precision, and 
so they have that advantage over general theories, at least in early stages 
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of experimentally established fact. But special theories can also be ap- 
plicable more than one at a time. Hence it is almost impossible to find 
a crucial experiment to test a special theory. If the experiment agrees 
with the theory, that is fine, but we all know that you cannot prove 
theories through occasional agreements, but can only keep them in the 
running that way; if the experiment disagrees with the theory, the theory 
may still be true, but something else may be in conflict with it. There is 
no point in giving up any theory with which occasional agreement can 
be found until there is some more general theory that accounts for all 
the agreements. The hypothetico-deductive method is by no means as 
simple as at first blush it appears to be! 

ESTIMATE OF FUNCTIONALISM 

Current functionalism is not a unified theory such as those earlier 
discussed. What bounds of unity there are lie in a tolerant acceptance 
of a wide range of psychological phenomena and a conviction that the 
task before psychology is to subject the many variables to quantitative 
study. Such a methodological unity permits wide diversity in content. 

The Functionalist Position on the Typical Problems 

of Learning 

Within such an unstructured system there is no one clear answer 
to most problems of learning. The answer to be expected is that “it all 
depends on conditions.” In practice the situation is not quite as free as 
this implies, for gradually there accumulate more commonly accepted 
generalizations. 

1. Capacity. Robinson recognized individual and species differences 
in his laws of individual differences and of composition. It is in line with 
functional developments for McGeoch and Irion to include in their book 
a chapter on learning as a function of individual differences. They believe 
that the increase of learning ability with age is best accounted for on the 
basis of two hypotheses: first, organic maturation, second, changing psy- 
chological conditions (transfer, motivation, personality traits). 

2L Practice . The law of frequency was to Robinson a law of relative 
frequency, which therefore recognizes the losses in score when practice 
is overcrowded, along with the gains when trials are more appropriately 
spaced. There is a tendency to emphasize the form of the learning curve 
and to seek the conditions under which one form rather than another is 
to be found. There is, however, no diatribe against a law of exercise. 
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3. Motivation . Woodworth’s dynamic psychology places motivation 
at its core. Carr accepts in principle the preparatory-consummatory se- 
quence, assigning motivation the role of a continuing stimulus to be termi- 
nated by the goal-response. The concept of ‘set” enters into the more con- 
ventional experiments on memorization and skill as a motivational 
supplement to the more familiar laws of association. It was the preoc- 
cupation of the early functionalist with such tasks as the learning of rote 
verbal series which tended to place motivation in the background rather 
than the foreground of theories such as Robinson’s. 

4. Understanding. While the association^ recognizes that meaning- 
ful material is more readily learned than nonsense material, degree of 
meaning is but one of the dimensions upon which materials can be 
scaled. Hence he does not believe that problem-solving or insight require 
interpretations beyond ordinary associative learning. The organism uses 
what it has learned as appropriately as it can in a new situation. If the 
problem cannot be solved by analogy, the behavior has to be varied 
until the initial solution occurs. Insight is perhaps an extreme case of 
transfer of training. 50 

5. Transfer . Following Thorndike, transfer falls chiefly under the 
law of assimilation. That is, transfer depends upon degree of likeness 
between the new situation and the old. Woodworth reinterprets the 
theory of identical elements to mean only that transfer is always of con- 
crete performances. 

What the theory of identical elements demands is that transfer should be 
of concrete performances, whether simple or complex makes no difference to the 
theory. 51 . . . Perhaps anything that can be learned can be transferred. But 
does not everything that can be learned have the concrete character of an act or 
way of acting? 52 

6. Forgetting. The favorite theory of forgetting is that of retroactive 
inhibition, but the functionalist does not insist that this is the whole 
story. There may be some forgetting according to passive decay through 
disuse, and there may be forgetting through repression, as pointed out 
by Freud. 53 

Has Functionalism a Systematic Theory of Learning? 

Functionalism is empiricist rather than systematic. It eschews in- 
ference for established experimental relationships between demonstrable 

50 McGeoch and Irion (1952), page 53. 

51 Woodworth ( 1938), page 177. 

62 Woodworth (1938), page 207. 

53 Robinson (1932b), pages 113-118. 
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variables. Its laws are quantitative, directly descriptive of data. There 
is a healthy respect for data, and there is a commendable urge to state 
issues specifically in a form subject to test. The relativism brings with it 
a freedom from bigotry. Before his untimely death, Robinson had turned 
to social problems, with the conviction that the same methods would 
work there. So long as people were forced to think in terms of specifics 
he believed that they could often reach agreement, even though on 
larger issues they were swayed by prejudices and preconceptions. 54 

The disadvantage of an extreme empiricism and relativism lies in its 
lack of articulating principles to cut across empirical laws. What results 
is a collection of many “laws,” without hierarchial structure. There is no 
economical multidimensional apparatus for fitting together the various 
two-dimensional functional relationships, each of which is necessarily 
cast in the form: “other things being equal.” If the dimensional program 
of the functionalists were fulfilled we should have a large handbook of 
data, with each of the several laws illustrated by a number of graphs 
showing the variations of associative strength under specific conditions. 
Empirical multidimensionality would be achieved through experimental 
designs testing several variables at once, but unless some simplifying 
steps were taken, the possible combinations and permutations of con- 
ditions would mount fabulously. Dimensional analysis puts data in 
order for exposition and for verification, but in itself does not connect 
the data into an economical scientific system. Such a system has to be 
logically structured as well as empirically sound. 

Functionalism is in some respects an eclectic position. It is eclectic 
on the problem of introspection versus behavioral description, by ac- 
cepting both in the account of psychological activity. A favorite illustra- 
tion is that less attention to details is required after a skill is mastered. 
Knowledge about the representation of details in awareness requires an 
introspective report, while the mastery of the skill may be studied by 
observation of the overt movements. Functionalism is eclectic on the 
problem of blind versus intelligent learning, accepting a continuum be- 
tween these extremes, according to the dimensional analysis of Mc- 
Geoch and Melton. Any point of view which is pragmatic and plur alis tic 
can easily incorporate concepts from alien systems; hence functionalism 
is well suited to play the mediating role. Functionalism is, however, an 
eclecticism with a bias — the bias of assoeiationism in favor of analytic 
units, historical causation, environmentalism. The bias is more evident in 
Chicago functionalism than in that of Woodworth, who has shown him- 
54 See, for example, Robinson ( 1935). 
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self friendlier to less associationistic concepts. New data may be ac- 
cepted but forced back into older concepts not fully appropriate. Thus 
some of the novelty of the observations made in insight experiments is 
lost when insight is treated as merely another illustration of the familiar 
transfer of training experiment. Similarly, what goes on in the reason- 
ing process is not fully encompassed by treating reasoning as trial-and- 
error learning merely a little farther out on the dimension of explicit- 
implicit response. Those who disagree with the associationist position 
object to the functionalist's incorporating of new experimental findings 
without accepting the theoretical implications of these findings. This 
is a problem with which the eclectic always has to wrestle: how much 
can be taken over without incorporating its systematic context? The 
functionalist has not been greatly concerned about the inner consistency 
of his borrowings because he is less concerned than others about the 
inner consistency of his own concepts. What consistency there has been 
is provided by the framework of associationism within which the func- 
tionalist works. 

Perhaps Meltons position is the best to take on the matter of func- 
tionalism's systematic position: the functionalist does not have an articu- 
lated system, but he would like to have one, and is not anti-theoretical. 
His breadth and self-criticism prevent his embracing a comprehensive 
system until facts are better ordered. 

Whatever the systematic limitations may be, it must be recognized 
that great energy for experimental study has been released within the 
functionalist group. The books of McGeoch and Irion (1952), Osgood 
(1953), Woodworth and Schlosberg (1954) amply attest to this. Out of 
the large number of research investigations on memory and skill, done 
largely within this group, there has come a rich body of data and factual 
relationships with which anyone interested in learning must be familiar. 

Convergence Toward Functionalist Interpretations 

Because there is so much room within functionalism, because it is 
an experimentalism, because it fits the American temper, it provides a 
kind of framework appropriate to much of American psychology. Many 
of the trends in contemporary theory and experiment, not explicitly re- 
lated to functionalism as a school of psychology, nevertheless are in 
accord with its historical orientation. 

1. The scientific logic of operationism , whereby scientific facts and 
concepts are related to the concrete operations through which they are 
produced, has been widely espoused by psychologists of most divergent 
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theoretical backgrounds.* 5 To the functionalist, operationism is a very 
natural development from James’ pragmatism and Deweys instrumental- 
ism. 

2. Mathematical models that are “neutral” with respect to many 
historical controversies are essentially “functional” models, in the mathe- 
matical sense of function. This mathematical sense of function is coherent 
with the psychological meaning of function. Hence it is not surprising to 
find Skinner (1953a) writing about the need for a functional analysis 
(meaning essentially a dimensional analysis), or to find Brunswik (1955) 
describing his system as a “probabilistic functionalism.” The mathe- 
matical models to be described in the next chapter can properly be de- 
scribed as functional ones. 

3. Miniature systems, in which restricted realms of data are sum- 
marized according to a few special theories, are also congenial to the 
functionalist. There is a heuristic pluralism involved in functionalism, 
not unrelated to James’ pluralism. That is, in order to move forward in 
the interpretation of data it is sometimes necessary to accept provisional 
(heuristic) interpretations that permit you to move ahead with the analy- 
sis. A case in point is VTE (“vicarious trial and error”) as proposed by 
Muenzinger and followed up by Tolman. This rather limited segment 
of behavior (vacillation at the point of choice) is given a name and 
its conditions studied. The many systematic questions ( Is it vicarious? Is 
it trial and error? ) can be postponed until some of the functional rela- 
tionships are explored. At this stage there is a pluralism, because there 
exist ad hoc concepts not related by definition or equations to the funda- 
mentals of an established system. 

Contemporary psychology, to the extent that it is increasingly op- 
erational, and increasingly concerned with mathematical models and 
miniature systems, reflects the general outlook that, in one form or an- 
other, has all along characterized the functionalist. 
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Chapter 11 

THE EMERGENCE OF 
MATHEMATICAL MODELS 


S CIENTIFIC description depends upon accurate observation. To make 
observations more precise, a scientist commonly uses recording instru- 
ments and then makes measurements with the aid of them. He records his 
measurements in numerical form, whether by counting, or ranking, or 
scaling. As he treats his data in order to discover lawful relationships 
among them he naturally turns to mathematics as the language designed 
to state relationships among the kinds of events to which numbers can be 
assigned. This is all so familiar, especially in the physical sciences, that 
mathematical expression for scientific relationships is taken for granted. 

The use of mathematics in the biological and social sciences is neither 
so widespread nor so successful up to the present time as it has been in 
the physical sciences. Hence we cannot be sure that all biologists or all 
social scientists would today assent to the proposition that ultimately their 
sciences must state their laws in mathematical form. We need not seek 
agreement on this proposition, however, in order to study such progress 
as there has been in the use of mathematical models in these fields. 

Learning data are well adapted to mathematical treatment. Relation- 
ships expressed in the form of learning curves or retention curves can 
also be expressed by equations. Because the quantitative effects of a num- 
ber of variables are known, further equations can be written. We have 
already become acquainted with one system in which mathematical ex- 
pression has been used throughout (that of Hull, in Chapter 5), and an- 
other in which an adaptation of mathematical ideas takes a central place 
(that of Lewin, in Chapter 8). We are ready, therefore, to review the 
place of mathematics in the study of learning, and to note particularly 
some recent developments. 
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EMPIRICAL AND RATIONAL CURVE-FITTING 

When Ebbinghaus (1885) found that his curve of retention was 
roughly logarithmic in form, he fitted a logarithmic equation to it: 

100k 

(log t) c k (1) 

where b = per cent retained by the savings measure 
t — elapsed time 

c, k = arbitrary constants obtained by curve-fitting 

Thus the very first quantitative study of associative learning intro- 
duced mathematical formulation. 

Ebbinghaus used the procedure that has since come to be known as 
empirical curve-fitting. When we follow this procedure, we first plot the 
experimental data and then look for some kind of mathematical function 
likely to fit it. This function represents a family of curves, say of hyperbolas, 
or parabolas, a growth function, or what not. Then we go through the 
added step of determining the constants (corresponding to the c and k of 
Ebbinghaus’ formula) in order to find that particular member of the 
curve family that comes nearest to fitting the obtained data. The com- 
monest method for doing this is known as the method of least squares. 
What this means is that we seek that member of the curve family that 
does as well as possible in passing near our experimentally determined 
points, the measure of success being the sum of the squares of the devia- 
tions of the curve from our data. We try to make this sum a minimum, 
hence the name ‘least squares/’ If we can find another curve family whose 
best fitting member produces a smaller sum of squared deviations, this 
second curve is a better fit, and is generally to be preferred. 

The main point in empirical curve-fitting is that we select the curve 
family solely on the basis of fit, and not on the basis of any theory. Ebbing- 
haus set out with no assumptions and required no theory that forgetting 
ought to follow a logarithmic curve; such a function merely proved to be 
a convenient summary expression of the facts he discovered. Life insur- 
ance companies predict the likelihood of survival on the basis of such 
empirical functions; they call them actuarial curves because they are 
based upon actual recorded facts. There need be no theory whatever as 
to why older people are more likely to die than younger ones. 

When rational curve-fitting is contrasted with empirical curve-fitting, 
the word “rational” implies that the family of curves is chosen according 
to some theory or theoretical deduction so that it is logical to expect the 
curve as plotted to be of a given form. Rational curves, because they arise 



370 THEORIES OF LEARNING 

from and test theories, are more elegant than empirical ones. A rational 
curve of slightly poorer fit may turn out to be more useful for purposes of 
science than a better-fitting empirical curve. The step of curve-fitting 
has to be taken with rational curves as well as with empirical ones. That 
is, while the form of the curve is determined by theory, the particular 
member of the curve family that fits best is determined by the method 
of least squares, exactly as in empirical curve-fitting. Thus the constants 
(or parameters , as they are called) of the rational equation are deter- 
mined empirically. The rational theory demands either that these con- 
stants should be interchangeable from one experiment to another or that 
they should vary in some predictable manner. Hence the theory upon 
which the rational equation is based can be tested by the agreements or 
disagreements among the constants as determined empirically in differ- 
ent experimental settings. These constants, furthermore, have reference 
to variables with additional (non-mathematical) meanings. 

Empirical Learning Curves 

Ebbinghaus’ lead was followed some twenty years later, in 1907 and 
1908, by attempts to state the learning curve in mathematical form. 1 
These early starts were followed from time to time by others as more 
men became interested in the problem. 2 

As an example of the empirical approach, let us consider some sug- 
gestions first made by Culler in 1928, then elaborated later by Culler 
and Girden (1951). Arguing that the complete learning curve must be 
S-shaped, that is, a curve of increasing gains at first, later becoming one 
of decreasing gains as the upper limit is approached. Culler and Girden 
compare the fit of the normal ogive with that of the equation for mono- 
molecular autocatalytic functions. The normal ogive, is of course, the 
familiar integral of the normal probability curve. The mono 7 molecular 
autocatalytic function yields a very similar curve. When written ap- 
propriate to learning, its equation is: 

_ be Ax 

y ~~ c 4 * e Ax ( 2 ) 

where y = a measure of learning 
x = a measure of practice 
A = a constant for the learner and the task 
b = a limit of attainment 
c s= a constant of integration 
e = the base for natural logarithms 

1 Schiikarew (1907), Robertson (1908). See also Robertson (1923). 

2 For an able review to that date, see Gulliksen ( 1934 ) . 
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This function, 3 as in the case of other empirical curves, does provide 
some hints as to what the learning process may be like. The particular 
mathematical expression implies that the gain per trial is proportional to 
the product of the amount already learned and the amount remaining to 
be learned before the limit of learning is reached. By proper choice of 
constants, this will yield a curve of increasing gains at first, later be- 
coming a curve of decreasing gains. Population growth can be expressed 
by such a curve; 4 the conjecture is inviting that some self-limiting growth 
process may underlie learning. As matters stand, however, the curve is 
not selected because of any formal theory that learning is such a process. 
The curve is chosen merely because it fits. 
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FIGURE 52. The fit of an ogive to learning data. The data were obtained by 
Culler from the learning of three-place symbols. Culler and Girden (1951), page $34. 

Culler and Girden test both equations for a great many learning data, 
and find that both fit quite well. Some illustrative curves are shown in 
Figure 52. These are fitted with an ogive; the authors conclude that 
empirical considerations alone do not permit a choice to be made be- 
tween the two curve forms that they tested. Culler and Girden are well 
aware that empirical curve-fitting is not enough, and that the choice of 
learning function must ultimately be made upon the basis of theory. 
They point out how futile it would be to obtain better-fitting curves 
through merely adding more parameters to an equation. 5 

3 The mono-molecular autocatalytic reaction was first proposed as appropriate for 
learning by Robertson ( 1908). The statement in Equation (2) is as given by Gulliksen 
(1934). 

4 Pearl and Reed ( 1920) . 

5 Culler and Girden ( 1951 ) , page 328. 
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When additional parameters are used, empirical results can, of 
course, be very closely approximated. The formula proposed for general 
use by Woodrow (1942), for example, contained five parametric con- 
stants: 

y~a 4- Vp 2 + k 2 - (l - — f»+**)* (3) 

His demonstration that he could fit it to a wide range of data did 
not make it survive as a general psychological law at all; it turned out to 
fit relationships of so many forms that it lost its theoretical interest. More 
than empirical curve-fitting is required if mathematics is to serve psycho- 
logical theory. 

While the deductive structure of Hull’s theory is rational in form, 
the basic curve of learning that he used did not rest upon any of the 
fundamental postulates about the way in which learning takes place. 
Hence his curve belongs with the empirical curves. The form Hull used 
can be stated as follows: 

bH e = M(1-«-») (4) 

where S H E = habit strength 

M = upper limit of habit strength 
t =z trials 

k = a constant expressing learning rate 
e = the base of natural logarithms 

Such a curve (as in the case of the autocatalytic one) can be given 
a meaning that suggests some sort of theory. Hull’s formula implies, for 
example, that the gain per trial decreases with practice throughout learn- 
ing, and that the gain per trial is 'proportional to the amount remaining 
to be learned before the limit of learning is reached. But Hull did not 
state why this should be so. 

Rational Learning Curves 

An advance in the application of mathematical thinking to learning 
was made by Thurstone ( 1930a ) ( 1930b ) when he based his equations 
on a theory as to how learning takes place. The psychological assump- 
tions are not very complex, and he did not pay any attention to the mecha- 
nisms underlying learning. What happens, according to Thurstone, is that 
the animal (or other learner) performs a number of acts per unit of 
time. His attainment can be stated as the probability that one of these 
acts will be successful. Thurstone introduced law-of-effect assumptions 
(without calling them that), for he proposed that successful performances 
led to further successes with practice, and unsuccessful performances 
led to a dropping out of errors. 
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If the learner at any one time has available to him s successful acts 
and e erroneous ones, his probability of success is simply the proportion 
of successful acts: 


s 

P ~s + e (5) 

Similarly, the probability of failure will be: 


<? = 1 -f = fT7 < 6 > 

If learning takes place with practice, we may write a differential 
equation to describe the increase in probability of success with time, in- 
troducing the constant k to describe the learning rate: 

ds , _ ks 

dt~ P ~~s + e (7) 

Error elimination can be expressed similarly, the negative sign in- 
dicating that errors are reduced with practice: 


de _ 
dt ~ 




-ke 
s + e 


( 8 ) 


Another learning constant is required, reflecting the difficulty or 
complexity of the task. Thurstone shows by computations based on equa- 
tions (7) and (8) that the following equation must hold: 


m = s x e (9) 

That is, the product of errors and successes available at one time 
will be a constant (m). When the task is complex, this product will be 
large; when the task is simple, this product will be small. 

Without any further psychological assumptions, Thurstone is able 
to perform an integration yielding the following curve of learning: 


2p — 1 kt 

— - — = — rr 4 - 3 

Vp — p 2 V m 

where p = probability of success per unit of time 
(the measure of attainment) 
k = constant reflecting learning ability 
m = constant reflecting task complexity 
t = time in the units over which probability is measured 
z = constant of integration 


( 10 ) 


By strict mathematical transformations, Thurstone (1930b) was able 
to make some predictions from his equation, in addition to fitting ordinary 
learning curves. He predicted, for example, the relationship between 
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length of task and learning time under varied circumstances and found a 
satisfactory relationship between his prediction and existing data. 

Gulliksen (1934) generalized Thurstones theory another step and 
arrived at another rational equation that proved quite satisfactory in 
fitting learning data (Figure 53). The constants in his final equation in- 
cluded the initial strengths of correct and incorrect responses, a con- 
stant for the strength added by repeating and rewarding a correct 
response, and a constant for the strength subtracted by repeating and 
punishing an incorrect response. Later Gulliksen and Wolfle (1938) ex- 



FIGURE 53. Success of Gulliksen’s equation in fitting data from individual 
rats trained in visual discrimination by Lashley’s jumping technique. Gulliksen 
(1934), page 420. 

tended the equation to cover discrimination learning and subjected it to 
experimental test. The papers of Thurstone, Gulliksen, and Gulliksen and 
Wolfle represent a kind of succession, in which each equation is some- 
what more general than the preceding one, and the earlier equations can 
be considered special cases of the later ones. 

This is about where the matter was left before World War II. 
There were other developments in the mathematics of learning, particu- 
larly by Rashevsky and his students and colleagues, but psychologists had 
not shown themselves ready to pay much heed to them. 6 The painstaking 
volume by Hull and his collaborators entitled “Mathematico-deductive 
theory of rote learning” appeared in 1940, calling strongly to the attention 
of the psychological world the possibility of a forward step in rigorous 
theorizing, including not only derivations in ordinary mathematics but in 

6 Rashevsky had dealt mathematically with learning problems in Amencan psycho- 
logical journals as early as 1931, and his 1938 volume on Mathematical biophysics 
contained chapters on conditioned reflexes, the- discrimination of relations, error 
elimination, rational learning and thinking. 
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symbolic logic as well. The stage was well set for new mathematical ap- 
proaches after the disruptions of the war years, based, to be sure, in part 
upon developments that wartime demands brought about 

SOME MODELS FROM OTHER FIELDS POTENTIALLY 
USEFUL FOR LEARNING THEORY 

In the last few years it has become customary to refer to some kinds 
of formal scientific theories as corresponding to one or another "model.” 
The notion of making a model to represent a theory is a very old one. 
Most of us have seen colored balls arranged on wires to represent the 
atoms in a molecule or crystal, though we know very well that the 
molecule under a microscope would not look like that. Yet the model 
does represent certain relationships. Or in describing the operation of 
the Organ of Corti and the basilar membrane we may have learned of 
a hydraulic model. Helmholtz" resonance theory used a harp as a model. 
The model represents a series of relationships — mathematical, physical, 
conceptual — which appear to be appropriate for the understanding of 
some realm of data. There must be a kind of "fit” between the data and 
the model. The adequacy of the model can be judged by its success in 
ordering data, and in making verifiable predictions from the data. 

The more formally a model is worked out, the more likely it is to be 
a mathematical one. A mathematical model consists of a set of relation- 
ships and transformations that can be made consistently according to the 
definitions and rules of the particular kind of mathematics. Data of many 
kinds and from many fields of science may be treated according to the 
rules of any one kind of mathematics. Thus the same rules of differentia- 
tion and integration apply within calculus whether one is investigating 
the action of an electronic circuit, an airplane wing under stress, the 
transmission of a nervous impulse or the behavior of a crowd. 

While a mathematical model can therefore be used with a variety of 
data, models tend to be developed in close conjunction with the solution 
of particular kinds of problems, and they show some of the history of 
their origins in the form they take. Whether or not these models will 
prove useful within learning theory may depend upon some analogies 
between the substantive problems in the field in which the model de- 
veloped and in the field of learning. 

Three types of model from other fields are worth mentioning here: 
the feedback model, the information-theory model, and the theory-of- 
games model. 
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The Feedback Model 

The science of cybernetics was christened by Wiener (1948), who 
encouraged the application of some principles coming from communi- 
cation engineering to physiological and social problems. The word 
"cybernetics” comes from a Greek word meaning steersman or governor, 
and cybernetics can be thought of as the study of control processes in 
machines, in organisms, in social groups. It is a mechanism of control 
that provides the model for these various activities. 

One of the central concepts of cybernetics, "feedback,” has gained 
wide currency in psychological discussion. Let us examine the analogy 
between adaptive behavior and a thermostatic control system. When the 
heat comes on and warms the room the thermostat records the rise in 
temperature. When the temperature reaches a critical point, the thermo- 
stat sends a "message” back to the furnace and turns off the heat. The 
furnace is thus controlled by a "feedback” from the consequences of the 
furnace’s operation. A man walking down a path has a "feedback” from 
his muscles, tendons, and joints, so that he makes corrective movements 
if the ground is uneven or if his foot strikes an obstacle. The awkward 
gait in tabes dorsalis arises because part of this feedback is missing, and 
the placing of the feet has to be controlled visually — thus using a less 
satisfactory feedback device. 

Is feedback a possible model for learning theory? Consider trial-and- 
error learning, especially in the form of approximation-and-correction. 
What the learner does on successive trials is regulated by the conse- 
quences of his activity. The old problem of the circularity of the law of 
effect can perhaps be resolved in terms of feedback. 

Another name for the feedback mechanism is servomechanism , and 
a large literature is now available, much of it expressed in mathematical 
form. 7 

The feedback model as such has thus far been little used in learning 
theory. Ellson (1949) showed how some of the input-output formula- 
tions of operational analysis might prove useful in the psychology of 
learning, but he failed to combine the feedback concept with the input- 
output one. Leavitt and Mueller (1951) show how the old problem of 
knowledge of results can be treated in the language of "feedback.” The 
instructor tries to communicate to a student a geometrical form, using 
only words. Success is much greater if the student is free to ask questions 
and to receive answers in the form of "feedback.” This seems fairly ob- 
7 For example, MacColl ( 1945); Truxal ( 1955). 
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vious, yet there are some interesting possibilities in quantifying the 
amount of feedback and its consequences for learning. 

The Information-Theory Model 

Both the feedback model and the information-theory model have 
their origins in the study of communication. Yet information-theory has 
had a somewhat distinctive formulation, with a clear influence upon 
psychology. 8 

The communication model suggests that psychological events can 
be understood through analogy with the events that occur when a mes- 
sage is transmitted through an electronic transmission system (e.g., a 
long-distance teletype system). The communication paradigm developed 
by Shannon 9 is reproduced in Figure 54. 



FIGURE 54. Model for a communication system* After Shannon. Reproduced 
from Grant, 1954, page 63. 

Information in the form of a message enters at the source and is 
converted to a code suitable for the given transmission system. A tele- 
type machine converts the typist's strokes into patterns of dots and dashes 
suitable for sending as electrical impulses. These go through the com- 
munication channel as electrical impulses, or perhaps as radio waves 
or light waves. They have to be decoded at the receiving end, and ap- 
pear as letters typed by the teletype receiving machine at the other end. 
Noise (static, etc.) may enter the message in the process of transmis- 
sion, and the message may be somewhat garbled before it is typed at the 
destination. In military practice a special coding process may be intro- 
duced, so that the ordinary decoding done by the teletype machine will 
not suffice, and the original message can be yielded only if the code is 
known (or is broken by a cryptoanalyst). 

In what respects are psychological events similar to the events in 
such a communication system? One way of picturing them is shown in 

8 The standard work on information-theory is Shannon and Weaver (1949). Useful 
introductions for psychologists are the papers by Miller ( 1953) and Grant ( 1954). 

9 See Shannon and Weaver ( 1949). 
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Figure 55. The organism here corresponds to the communication chan- 
nel, transmitting information to the psychologist by way of its behavior. 
The observable responses make their appearance only as complex re- 
sultants of prior conditions as these are “encoded” (perceived? reacted 
to?) by the organism. The responses may be modified by non-essential 
circumstances (here “biosocial noise”), so that the observing psycholo- 
gist has a difficult task in interpreting (“decoding”) the subject’s be- 
havior. This is but one way of using the communication model to de- 
scribe psychological events, and other analogies are permissible. For 
example, the conversation between two people can be described accord- 
ing to the successive processes of encoding, transmitting, and decoding 
messages. 10 



FIGURE 55. Communication model in which the organism is conceived as 
emitting signals to be received or interpreted by the observing psychologist. From 
Grant, 1954, page 65. 


The service to psychology of the communication model has thus far 
come largely through some special considerations relating to information 
in its quantitative aspects. 

Information is, of course, a very familiar concept. We receive in- 
formation when we get our questions answered; we use information when 
we find our way about; the dictionary, the telephone directory, the en- 
cyclopedia, are full of information. But when we turn aside from the 
rich substance of information to consider only amount of information, 
and to find a unit by which to describe the quantity of information, we 
enter the province of information-theory. The basic notion is that in- 
formation is discriminative; it answers the question: “Which of a num- 
ber of alternatives is this?” The more alternatives there are, the more 
information is conveyed by the answer. If you give a person’s name you 
give more Information about him than if you give merely his sex, because 
through his name you have identified him among many alternatives, while 
by giving his sex you have identified him among only two alternatives. 

10 For parallels between physical communication systems and various problems of 
human intercommunication, see Schramm ( 1954), pages 3-26. 
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The relation of information to number of alternatives is given more 
formal expression: the amount of information yielded by specifying one 
of a number of alternatives increases as the logarithm of the number of 
alternatives. 

Where does this logarithmic relationship come from? The simplest 
way to look at it is that we can communicate information in a formal 
sense by reducing alternatives one half at a time. Suppose you were 
trying to guess a number between 1 and 32 and can ask questions an- 
swered only by “yes” or “no.” The most information you could get from 
your first question by an orderly approach would be whether or not the 
number was 16 or less, or above 16, thus reducing your alternatives by 
half. By continuing this process, you would arrive at your number. How 
many questions would you have to ask? It is not hard to prepare a little 
table to see how this works: 

Range Within Which Number of Guesses 

Number Lies Required 

1-2 ... 1 

1-4 2 

1-8 3 

1-16 4 

1-32 x 5 

We may say that specifying a number from among a larger set of 
numbers gives more information than specifying a number from among 
a smaller set. The relationship between our two columns can be ex- 
pressed as follows: 

Number of guesses required = log 2 number of alternatives 

This is the model used for information theory. The unit of informa- 
tion is called the “bit,” which is a clever adoption of a familiar way of 
talking about small amounts of information, combined with a conden- 
sation of “binary digit.” The “bit” of information is the unit of informa- 
tion gained whenever the number of alternatives is reduced by one-half. 
If a message reduces the number of alternatives k to some fraction k/x, 
then the amount of information in the message is log 2 x bits. 

Information-theory deals with such problems as the disturbance of 
a message as it goes through a transmission system. The system con- 
tains random interference (“noise”) which requires that the message have 
some “redundancy” in order that it may be understood at the end. The 
English language is said to be about 50 per cent redundant That permits 
a message to be somewhat garbled as it comes from the telegraph office 
and still be understood. Because of noise the amount of unavailable in- 
formation at the end of a transmission system is greater than at the be- 
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ginning, suggesting a parallel to the physical notion of entropy . In the 
transformation of energy, some is always dissipated as heat, and no 
machine is 100 per cent efficient. The second law of thermodynamics 
states this principle as an increase in entropy. So, too, in communication 
some available information tends to be lost and unavailable informa- 
tion increases. 

Because learning involves input and output, with processes of dis- 
crimination, variability, 11 and interference between stimulus and re- 
sponse, the information-theory model might presumably find useful ap- 
plications. This has indeed proved to be the case. 

There is a familiar demonstration experiment in psychology in which 
we show how much easier it is to learn a list of related words than a list 
of unrelated ones, and how, in turn, the unrelated words are easier to 
learn than a list of nonsense syllables. While the results may be stated 
quantitatively (showing a statistically significant difference between 
each pair of lists) the differences between the lists, as materials to be 
memorized, remain qualitative (* related,” "unrelated,” "nonsense”). 
Here information-theory comes to our help because it is possible to 
show on the basis of the statistical characteristics of the English lan- 
guage the amount of information that has to be acquired in order to 
learn each list. A familiar word conforms to English language practice 
in following one letter by another in non-random fashion. (An extreme 
illustration is always following q by u } but a t is also followed by an h 
beyond chance, a / seldom followed by a c, and so on.) Nonsense syl- 
lables are constructed out of more nearly random sequences of letters, so 
that more information has to be acquired in learning them than in learn- 
ing meaningful words. Word sequences also have their probabilities in 
English, so that less information has to be acquired to learn "related” 
words than to learn "unrelated” ones. This is particularly true if the re- 
lated words form a sentence, thus capitalizing on non-random features of 
familiar sentence construction in English. 

It would be difficult to take the typical demonstration experiment 
as ordinarily used and make the necessary computations to assign diffi- 
culty in terms of the amount of information that has to be acquired to 
learn each of the three kinds of lists, but, in principle, it could be done. 

11 Information measurement can be interpreted in terms of variance and covariance. 
Large variance corresponds to a large amount of unassimilated information (i.e., ig- 
norance). Covariance or correlation corresponds to the relationship between input 
and output, that is, to the amount of transmitted information. These points are made 
by G. A. Miller (1955). 
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Closely related experiments, however, have been designed and per- 
formed. 

Miller and Selfridge (1950), for example, constructed materials with 
different degrees of approximation to the statistical structure of English. 
A random selection of words from the Thorndike-Lorge list of the 30,000 
commonest words provided the “sentence” most unlike English. This they 
called the zero-order approximation to English. The next, or first-order 
approximation, was a random selection from the words produced by the 
subjects in the higher-order sentences about to be described. Then fol- 
lowed a series of approximations each constructed in similar fashion. The 
next approximation (second-order) was obtained by constructing a “sen- 
tence” one word at a time by giving the first word to one subject who 
responded with a single word, then giving his one-word response to an- 
other subject who added one more word, and so on until the “sentence” 
was constructed. This procedure goes beyond the purely random list in 
preserving some of the word-association sequences of English. The next 
order of approximation gave as the context for response the two preced- 
ing words in the sentence. The procedure was followed through 7-word 
sequences. Finally, the authors selected a passage from current fiction or 
biography as a meaningful English text to determine how close the 
higher orders of approximation were coming to “meaningful” material. 
Results for passages of different length constructed in this way are shown 
in Figure 56. It is quite clear that as the passages come nearer to the 
structure of English they are more readily recalled. To a rough approxi- 
mation, the amount of information recalled was constant. Hence diffi- 
culty was in reality not a function of “meaningfulness” per se, but of 
quantity of information that had to be acquired. 

Inspection of Figure 56 shows that a 50-word passage at the fifth 
order of approximation is learned as readily as a fully “meaningful” pas- 
sage of text. Are we to assume then that the fifth order of approximation 
is really meaningful text? Here is an illustration of such a passage: 

House to ask for is to earn our living by working towards a goal for his team 
in old New-York was a wonderful place wasn’t it even pleasant to talk about and 
laugh hard when he tells lies he should not tell me the reason why you are is 
evident. 12 

It appears that there are kinds of nonsense as easy to recall as pas- 
sages taken from a novel. The nonsense is easy if it preserves the learned 
structure of the language, even though it remains nonsense. The struc- 
12 Miller and Selfridge ( 1950), pages 182-183. 
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ture can be treated quantitatively according to information-theory. Thus 
information-theory provides a powerful new analytic tool. 

Hovland (1952b) has furnished another illustration of the usefulness 
of the information-theory approach. He reanalyzed the concept forma- 
tion studies by Smoke (1932) (1933). Smoke had found that in the ac- 
quiring of concepts through presenting various embodiments of the 
concept along with irrelevant cues, "positive instances” in which the es- 
sential characteristics of the concept were included facilitated learning 



FIGURE 56. Amount retained after equivalent practice with materials of differ- 
ent degrees of approximation to English. Miller and Selfridge (1950), page 181. 

much more than "negative instances” in which some characteristics es- 
sential to the concept were lacking. Hovland questioned the generality 
of Smoke’s conclusion because of two possibilities: subjects might be 
reacting to differences in information in positive and negative instances, 
or they might for some reason find it more difficult to assimilate the in- 
formation given by negative instances. If the second possibility is the 
true one. Smoke’s conclusion would be acceptable as an important gen- 
eralization about concept formation. If, however, the first possibility is 
correct, and negative instances furnish less information than positive 
ones, stimulus presentation would have to be blamed for the finding, and 
the generalization about negative and positive instances would have 
little psychological meaning. 

It should be noted in passing that these questions can also be raised 
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about some cases testing the positive and negative aspects of the law 
of effect, in which verbal * punishment” was sometimes found less effec- 
tive than verbal "reward.” It will usually turn out that saying Wrong 
eliminates fewer possibilities than saying Right . In that event the gen- 
eralization about the asymmetry between reward and punishment takes 
on new meaning. 

Hovland’s analysis of the amount of information conveyed by posi- 
tive and negative instances in the concept experiment shows that the 
amount varies greatly. In general, as the total number of possible re- 
sponses increases, the minimum number of negative instances needed 
to convey the concept increases, even when the necessary number of posi- 
tive instances remains constant. In the most extreme case, a minimum of 
two positive instances sufficed to convey the concept, while 625 negative 
instances were required! 

The experimental test conducted by Hovland and Weiss (1953) 
showed that human subjects do not behave as a machine would which 
was built according to the laws of information-theory. Even when the 
amount of information yielded by positive and negative instances was 
equated, they learned better from positive instances than from negative 
ones. While thus showing the simple information-theory model not suf- 
ficient to account for concept learning, there was some gain in having 
equated the information for positive and negative instances. There was 
at least some learning from negative instances, a result Smoke had not 
found. 

It is evident that in the studies by Miller and Selfridge (1950) and 
of Hovland ( 1952b), information-theory has provided a useful tool. Many 
other studies are currently appearing. They will help to define the range 
of this usefulness. 13 

The Theory-of -Games Model 

The appearance in 1944 of von Neumann and Morgensterns Theory 
of games and economic behavior marked the beginning of a very rapid 
development, especially among economists, of a type of model of po- 
tential usefulness in other fields of social science, including psychology. 
The mathematics employed leans heavily upon concepts originating in 
set theory and linear geometry, branches of mathematics until recently 
little studied by social scientists. 

18 Thus far quantitative aspects of information (“bits”) have been used almost 
exclusively in the description of the stimulus. The sequential aspects of communica- 
tion, including the interesting problems of encoding and decoding, are just beginning 
to be brought into the psychological laboratory. 
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As its name implies, the theory attempts to define the optimum 
strategy in competitive games, such as simplified poker games or modi- 
fied chess games. A player makes his moves against uncertainty, either the 
uncertainties of chance events (such as the order of the cards in a 
well-shuffled deck), or the uncertainties of the moves of the opponent. 
The game provides a simplified model of economic behavior, in which 
the decisions of an entrepreneur are made according to his predictions 
of an uncertain market and the moves of his competitors. The original 
theory-of-games model assumes that the players are thoroughly rational, 
and that they always make the best decision that the available informa- 
tion allows. In the game of tick-tack-toe, for example, it is possible to 
master the rational strategy, so that the outcome of the game is predict- 
able for expert players. But we know that in many instances the rational 
course is not chosen. We shall meet an illustration on page 390, in which 
it will be pointed out that in guessing against a 75:25 odds, subjects com- 
monly guess in the 75:25 pattern, while theory-of-games would easily 
show that hits would be maximized by always guessing on the 75 side, 
that is, guessing in a 100:0 pattern. 

In addition to the estimation of probabilities of outcome, the in- 
dividual playing a game prefers one outcome to another. In the language 
of theory of games, one outcome has higher utility than another. In other 
words, decisions are based upon value systems as well as upon prediction 
systems. 1 * Utility and value are essentially motivational terms, and hence 
relevant to psychology. Some empirical investigations have been made 
to translate assumed utility systems into measures of utility as expressed 
through individual behavior. 15 

A game usually involves more than one player, so that the theory 
of games tends to be applied to group decisions, although, of course, 
individual choice is involved. 16 Some learning theorists, however, have 
attempted to bring game-theory into relationship with individual learn- 
ing. A relevant example is that of Flood (1954) who entitles his paper 
“On game-learning theory and some decision-making experiments/' 

Flood attempts to show how a player can learn to improve his 
strategy during a sequence of plays. He accepts basic assumptions from 
both the von Neumann-Morgenstern game theory and the Bush-Mosteller 
learning theory to be discussed later in this chapter. Among the assump- 
tions is one that learning can be described by the special type of 

helpful elementar y introduction to these notions is contained in Bross 

{ 1953), 

18 E.g., Mosteller and Nogee ( 1951). 

«E.g.,Airow (1951b). 
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stochastic process known as a Markov process, in which only the imme- 
diately preceding event is used in making predictions about the one that 
follows it. The author admits that this assumption will probably not 
prove adequate. 17 

There is likely to be a greater interplay between the theory-of-games 
model and psychology within the next few years, but it is already quite 
clear that the model will have to undergo some modifications to move 
from the assumption of rational behavior to more appropriate assump- 
tions based on the actual behavior of individuals. This need is widely 
recognized. For example, Estes in reporting individual behavior in the 
face of uncertainty concludes that “in a simple decision process the 
human subject tends to behave in accordance with the principles of as- 
sociative learning and not, in general, in the most rational manner as 
rational* is conventionally defined.” 18 Similarly, Hoffman, Festinger, 
and Lawrence (1954), in a study of coalitions in a competitive situation, 
are able to specify motivational changes affecting the coalitions in a 
manner to deny the symmetrical utility function required by game 
theory. Davis has taken the position that their situation is gamelike, but 
probably better treated as general psychology than as game-theory. 19 
Davis’ position is little likely to lead to a unified theory of social behavior. 

These brief introductions to feedback models, information-theory 
models, and theory of games, suffice as a background for the model- 
building activity of contemporary learning theories. There is a family 
resemblance among all these models. They all deal with sequential be- 
havior, they all deal with uncertainty and decision-making. While these 
three models were developed in contexts other than psychology, it is 
clear that they are gradually influencing contemporary psychological 
theorizing. 

MATHEMATICAL MODELS IN CONTEMPORARY 
LEARNING THEORY 

We have already seen, in a preliminary way, how mathematics has 
been used in relation to learning data. Empirical curve-fitting uses a 
loosely defined model: merely a family of curves selected according to 
convenience and without theoretical justification. When some new data 
are better fitted by a curve from a different family, the old model is 

17 Markov processes were first mentioned in the interpretation of learning by Miller 
and Frick ( 1949), and they are assumed in most of the later mathematical treatments. 

18 Estes (1954a), page 136. 

19 Davis ( 1954), page 15. 
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abandoned in favor of a new one. Rational curve-fitting corresponds more 
closely to contemporary model-building, for the nature of the curve is 
fixed by the theory, and predictions follow. 

Models Based Largely upon Differential Calculus 

The success of calculus in handling so many problems of physics 
and engineering has made it the standard fare of most college mathe- 
matical sequences. Because calculus deals with rates of change, it appears 
to be an appropriate mathematics to try out in relation to problems of 
learning. 

Models depending largely upon differential calculus have been 
extensively used by Rashevsky and his colleagues and students in studies 
over many years. While the models are usually referred to the nervous 
system, little of neuroanatomical detail is involved, so that the model is, 
in fact, more largely mathematical than anatomical or physiological. 
Despite the ingenuity displayed, and the mathematical sophistication, 
these models have had relatively little influence within psychology. 
Their neglect must be due only in part to the inability of psychologists 
to follow the mathematics, for other difficult mathematical models have 
been understood by at least a few psychologists. Some of the neglect may 
have been due to the lack of interest in neurophysiological speculation by 
learning theorists. Possibly more important is the social fact that no one 
has attempted a consistent experimental program tied to the systematic 
developments. It is not enough that the mathematical consequences satis- 
factorily fit old data. For psychologists to become interested, new and 
relevant data must be predicted. The failure of anyone to pick up the 
threads of the Rashevsky theories may be merely a kind of historical 
accident, for the possibilities appear to be there. 20 

We saw in Equations (7) and (8), page 373, how Thurstone be- 
gan by setting up differential equations and then through integration 
arrived at his learning-curve equation. Many models begin in this way. 
To illustrate the naturalness of this approach to the construction of a 
mathematical model, we may consider a very sketchy model recently 
proposed by Simon (1954). He called it a “Berlitz model,* after the 
self-instruction method of learning a foreign language. He makes the 
following assumptions about a person starting out to learn French by this 
method: 

20 For a review making the Rashevsky models somewhat more accessible, see Cole- 
man (1954). These models deal with social behavior, rather than with learning, but 
the review reveals the type of logic employed. 
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1. The individual is free to practice as much or as little as he wishes. 

2. As he practices, the activity becomes easier for him. 

3. The pleasantness of the activity increases as it becomes easier for him. 

4. The more pleasant the activity, the more he engages in it; the less pleas- 
ant the activity, the less he engages in it. 

It takes very few additional assumptions to express these verbal 
statements in a mathematical model. 

First, the rate according to which difficulty decreases as practice 
continues can be expressed by the following differential equation; 


dP 

dt 


—aPx 


(ID 


where D = level of difficulty at any one time 
x = rate at which practice takes place 
a = a constant based on the learner's capacity to learn 
t = elapsed time 


This means, in words, that the rate of change in difficulty decreases 
as the product of the difficulty level at the time and the rate of practice 
at the time. With this equation as the starting point, we will come out 
with a logarithmic decrease of difficulty with practice when the differ- 
ential is integrated. Thus the statement that practice reduces difficulty 
has become more precise as we use a mathematical model instead of the 
verbal one; whether or not it is true remains a matter for empirical 
testing. 

A second differential equation expresses the motivational assumption 
that the task becomes more pleasant with practice, and that, as it becomes 
more pleasant, there will be a tendency to practice more: 


f t — b ^ 


( 12 ) 


where x = rate at which practice takes place 

5F = satiation level of practice for any level of difficulty 
Less practice is pleasant, more practice is unpleasant 
— b = a constant based on the learner's motivation 
t = elapsed time 


Put into words, this says that the amount of practice will decrease 
proportionately to the amount that the rate of practice exceeds the 
satiation level. 

Even as simple a model as this permits predictions of time paths, of 
practice and difficulty, provided the original values of difficulty (D 0 ), 
the initial practice rate (x 0 ), and the satiation level (x) are known. 
Here are examples of some common-sense predictions based on the 
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model. These predictions follow strictly, however, from the mathematical 
implications of the model: 

a . If the initial values of D 0 and x 0 are very high, it can be predicted 
that the decrease in x will be more rapid than the decrease in D, so that 
the student will give up his practice before learning French. 



FIGURE 57. Some implications of Simon's “Berlitz” model. Simon (1954), page 
403. 


b. With intermediate values of D 0 and x 0 there will be temporary 
discouragement and slowing up of practice, because practice of difficult 
material is unpleasant, and unpleasantness slows practice. The mathe- 
matical model shows, however, that there is a critical point, and if diffi- 
culty falls below this point before practice stops, practice will become 
pleasant and learning will proceed to mastery. 

c. If difficulty is low, and initial rate of practice also low, learning, 
according to this model, will proceed smoothly. 

These three conditions are illustrated in the curves of Figure 57. 

Even the student entirely unfamiliar with calculus will perceive 
a certain naturalness in the attempt to fit this model to a learning situa- 
tion. The model is proposed by Simon for purposes of illustration, and he 
does not think of it as sufficiently developed to be used to design em- 
pirical research. 

Stochastic Models 

Psychologists have long been familiar with probability concepts as 
underlying psychological measurements. They have also recognized 
the possibility of extending probability concepts to sequential data of 
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the sort found in learning. Thus Thurstone (1930a), with a background 
in psychophysical measurement, found it appropriate to use probability 
concepts in developing his rational curve equation, although the mathe- 
matical procedures used were those of differential and integral calcu- 
lus. 

The kind of model based on probability mathematics rather than 
upon differential calculus 21 has come to be called a stochastic model. 
A stochastic process is defined as a sequence of events together with the 
probabilities of these sequences. Thus for the older notion of a casual 
chain we now have the related concept of stochastic process — a chain 
with uncertainty at every link. These uncertain but possible events at 
each link are expressed according to the mathematics of probability. 
Perhaps the Tolman and Brunswik (1935) expression “causal texture” 
can be appropriately substituted for “causal chain” when we deal with 
stochastic processes. 

We shall turn now to two illustrations of stochastic models within 
learning theory. 

Estes’ Statistical-Association Model 

Estes (1950) presented a statistical model with psychological as- 
sumptions very similar to those of Guthrie. As general background theory 
he assumed that the stimulus complex consists of a “finite population of 
relatively independent environmental events.” Only a sample from this 
population is effective at any given time. Responses, too, fall into classes, 
and response instances occur and enter into experiments as the dependent 
variables. To develop a probability model, response probability has to be 
defined. Estes defined response probability as the average frequency of 
occurrence of members of a response class relative to the maximum 
possible frequency under the conditions of the experiment. The basic 
learning assumption is that association occurs through contiguity. That 
is, if a stimulus element is present once along with a response occurrence, 
the association is formed at full strength through this single occurrence. 
Response probability for the class of responses being learned increases 
with practice because more stimulus elements become conditioned to it 
on succeeding trials, and more elements become alienated from their 
association with other response classes. This is, of course, Guthries 
theory, expressed by Estes in mathematical form. These psychological 

21 This does not mean that calculus is not used in the course of developing these 
models, for you do not avoid calculus in probability theory any more than you avoid 
algebra in calculus. The main point is that calculus deals with regularities, while 
probability deals with measures that contain some degree of uncertainty. 
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assumptions suffice to permit equations to be written, predictions made, 
and agreements or disagreements with data determined. 

The theory first proposed by Estes (1950) has been refined mathe- 
matically and applied to new problems by Estes and Burke (1953) and 
Estes (1953) (1954a). 

Let us consider, first of all some data to which the developed model 
is appropriate. The experiment is one by Grant, Hake, and Hornseth 
(1951), extending a kind of experiment introduced by Humphreys 
(1939b). 

Two lights are mounted on a board before the subject. The subject 
is asked to guess, while the left light is on, whether or not the second light 
will flash on. The guess is recorded, and the subject notes whether or 
not his guess was correct. The experiment was so arranged that for 
different groups of subjects the percentages of second flashes varied. 
The percentages of trials on which the second light flashed on were 
0, 25, 50, 75, and 100 per cent. Results are plotted in Figure 58. Note 
that after 60 trials subjects tended to be guessing at about the percent- 
age at which the lights were flashing. 

Although these results appear plausible enough, a little reflection 
will show that they are not the results that would be given by a “rational” 
decision-maker, as implied, say, in game-theory. Consider the cases in 
which lights come on 75 per cent of the time and are off 25 per cent of 
the time. Any gambler knows that if the odds are 3:1 that one event 
will occur rather than another, once he detects these odds he will maxi- 
mize his hits by always betting on the more frequent event. Yet the 
subjects somehow “preferred” to match the stimulus probabilities with 
their guesses rather than to maximize their hits. Many similar experi- 
ments have yielded similar results. 22 

We need to examine a little more carefully the sense in which 
Estes’ theory is a contiguity theory similar to Guthrie’s. The events in 
this light-guessing experiment are conceived by Estes to parallel classical 
conditioning as follows: 

1. Conditioned stimulus = signal light (left light). 

2. (Conditioned response = saying “on” or “off' in anticipation of what is 
going to happen on the right. ) 

22 The reader must be warned that these are experiments in guessing without risk. 
If there are penalties for errors, the described experimental results are no longer found. 
When Humphreys* (1939a) subjects received a puff of air to the eye following 50 
per cent of the conditioned stimuli, they approached the same asymptote for their 
conditioned eye-blinks (and at the same rate) as though they received the air-puff 
100 per cent of the time. The over-reaction to the 50 per cent reinforcement presumably 
depended upon the threat involved in not responding. 
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3. Unconditioned stimulus = right light “on” or “off*” 

4. Unconditioned response = saying “on” of “off” according to what hap- 
pens on the right. 

In a contiguity theory the conditioned response is unimportant ex- 
cept as an indicator of the state of the learned-response system, and as 
furnishing some of the components of the stimulus situation present 
when the unconditioned stimulus and response occur. The conditioned 



FIGURE 58. Positive responses when a first light is followed by a second light 
in specified percentages of the trials. Note that the limit approached tends to corre- 
spond to the percentage of times that the light appears. The trials are numbered in 
blocks of five. After Grant, Hake, and Homseth ( 1951 ). 

response as such is not important in determining the terminal events that 
are responsible for what is learned. The contiguous items in Estes' 
system are the conditioned stimulus, the unconditioned stimulus, and the 
unconditioned response. The conditioned response (the “guess” as to 
whether or not the light is coming on, or which light in some cases) does 
not enter the equations as a modifier of the influence of the unconditioned 
stimulus. Thus there is no “confirmation” of a correct guess and no “cor- 
rection* of an erroneous one. All that really matters is the occurrence 
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or non-occurrence of a light and the verbal response to it in the presence 
of cues (the effective conditioned stimulus). Thus the interpretation is 
strictly a contiguous association one, without reinforcement or extinc- 
tion of a prior response, as in reinforcement-type theories. 

Now to return to Estes and Burke. We shall see something of the 
construction of their theory in relation to the experiment previously 
described. In this we follow Estes (1953). 

Let Ei be the case when the second light comes on and E 2 the case 
when the second light fails to come on. Let p^n) be the probability 
that Ej will be guessed on trial n, and p t (n + 1) the probability that 
E 1 will be guessed on trial ( n + 1 ) . The student unfamiliar with proba- 
bility notation will have to be on his guard not to interpret all paren- 
theses as signifying multiplication, as they commonly do in algebra. The 
parentheses after p 19 here do not imply multiplication. Unfortunately, 
mathematicians expect you to know their conventions, which are no 
more consistent than the conventions of orthographers in other languages. 

The theory assumes that every time the right light comes on and is 
responded to by “on” (i.e., whenever the unconditioned stimulus and 
unconditioned response occur) there will be an increase in the proba- 
bility of responding to the signal (the conditioned stimulus) by guessing 
“on” (the conditioned response). That is, if E x occurs on trial (n) the 
increase in probability of its being guessed on trial (n + 1) can be 
stated according to the equation: 

Pi(n + 1) - p t (n) = 0[1 - p x (n)] (13) 

where 0 is a fraction between zero and one. 

We see from Equation ( 13 ) that the increase in probability per trial 
is a constant fraction of the amount remaining to be learned, 1 — p 1 (n) 
being the difference between perfect responding ( at a probability of 1 ) 
and the present probability, p,(n). This will obviously yield the same 
form of learning curve as that adopted by Hull (see page 372). 

Now if E 2 (absence of light) occurs, the probability of predicting 
E x diminishes because by association theory it is the response to E 2 
that should be favored on the next trial. This diminution of probability 
is expressed by the following equation: 

p,(n + 1) - p x (n) = - 6p x (n ) (14) 

These two equations are known as operator or difference equations . 
They merely state in precise form the assumption that probabilities of 
repeating one of a pair of alternative responses increase with positive 
cases and decrease with negative cases. The constant 6 has to be deter- 
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mined empirically, but once determined it should hold through many 
variations of the experiment. 

The fraction of trials that the light comes on (when E 2 occurs) can 
be expressed as * (Greek letter pi), and the trials on which it is omitted 
(when E 2 occurs) as (1 — «*). We know what happens when E x occurs 
from Equation (13) and what happens when E 2 occurs from Equation 
(14). Because we know the relative frequencies of E x and E 2 , we can 
combine Equations (13) and (14) directly: 

P x (n+ 1) -p x (n) = »(0[1 — p 1 (n)]) 4* (1 — w )[-0p x (n)] (15) 

This equation looks difficult to the mathematical novice because of 
the parentheses and brackets, a subscript, some Greek letters, and un- 
certainty as to which parentheses are really subscripts and which mean 
multiplication. Anyone with high school algebra can carry out the 
multiplication and combine terms. When this is done, the following 
equation results: 

P x (n+ 1) — p x (n) = 0 [t r-p x (n)] (16) 

Comparing Equation (16) with Equation (13), we see that w has 
replaced 1. In other words, the limit of learning has changed from 1 to tt. 

Equation (16) gives us the expected change of p x from any one 
trial to the next. Now we skip some mathematical steps leading from 
Equation (16) to a derived learning-curve equation. It can be shown 
rigorously that beginning with trial 0 and going to trial n, we have as the 
probability of guessing E x on any trial n: 

Pi(tt) = *■-[»- P^O) ](1 - 6) n (17) 

This is a derived learning-curve equation, and in form it looks very 
much like many others in the literature, for example, that of Hull, 
Equation 4, page 372. But the constants have a special meaning in this 
case. Just as Hull’s curve approaches M as a limit, so this curve ap- 
proaches 7r as a limit. This follows because the second term in Equation 
(17) becomes negligible for very large values of n. But tt is the relative 
frequency with which E 3 occurs. 

Now we come back to our experimental findings. The mathematical 
theory has predicted exactly the result found by Grant, Hake, and Hom- 
seth (1951), and other experimenters, for the asymptote of the guesses 
in these experiments tends to be ir, the relative frequency with which 
the second light comes on. 

While this is a most incomplete account of Estes’ theory, it serves at 
least as an introduction to it 
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Bush and Mosteller s Linear-Operator Model 

Bush and Mosteller (1951a) were the first to propose a linear- 
operator model for learning theory, although such operators were later 
incorporated into the Estes and Burke model just described. We shall 
see presently how mathematically similar the two models are, but first 
we may examine the similarities and differences in psychological as- 
sumptions. 

Bush and Mosteller make essentially the same assumptions as Estes 
with respect to stimulus and response. Their assumptions are stated 
with particular clarity in their second paper. 23 In this paper they use 
the notation of "set theory” in mathematics. For both Estes and Bush 
and Mosteller the organism perceives (or reacts to) some fraction of 
the total stimuli available. Whereas Estes describes this fraction in 
terms of the proportion of elements in a totality, Bush and Mosteller 
speak of the "measure” of the elements. The "measure” is a kind of sum 
of weighted elements, and serves the same purpose as Estes’ number of 
elements. The set-theoretic model in this second paper is used to derive 
the linear operators of the earlier paper. Because die psychological as- 
sumptions are unchanged in this more elegant mathematical treatment, 
we return now to the original psychological assumptions. 

The basic difference between the psychological assumptions of 
Bush and Mosteller and of Estes rests on interpretation of reinforcement. 
Estes follows a contiguous association theory, not requiring the opera- 
tion of reinforcement or of extinction as fundamental principles. Bush 
and Mosteller adopt principles of learning more like those of Hull and 
Skinner, accepting one parameter (a) for the positive influence of rein- 
forcement, and another parameter (b) for the negative influence of 
punishment or of work required in responding, that is, a work inter- 
pretation of extinction. 

Their basic operator, in their notation, is: 


Qj> = P + a(l -p) - bp (18) 

where Q p = the operator Q operating on p, the probability that a response 
will occur during a specified time 
a = parameter associated with reward 
b = parameter associated with punishment or work required 
p = probability of response 


This is the linear operator that gives the model its name. 24 
23 Bush and Mosteller ( 1951b). 

- 4 We may rewrite Equation ( 18) in a form to show more clearly that it is a linear 

- ? - b P + * Because ( 1 -a - b) and (a) constants, Z 
equation is m typical linear form, e.g., y = ax -f b. 
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We may define the isolated effect of reinforcement by setting b — 0 
in Equation (18): 

Q, = p + fl(l-p) (19) 

1 

Similarly, the isolated effect of extinction can be studied by setting 
a = 0 in Equation ( 18 ) : 

Q p = p — bp (20) 


Now we have two equations from Estes, Equations (13) and (14) 
and two from Bush and Mosteller, Equations (19) and (20). How are 
these related? In the case of flashing lights we may assume that the 
effect of saying “on” to a light that has appeared will affect the next 
performance in an amount equal to the effect of saying off to a light 
that has not come on (although, of course, the effects are in opposite 
directions ) . That is, we may set the parameters all equal, a = b = 0. 
Then we can show that the Estes and the Bush and Mosteller equations 
are equivalent. They may be written in parallel as follows: 


p ± (n + 1) = Pi(n) 4- 6[l - p t (n)] 

Q p = p + 0[1 — pi 

i 

p x (n + 1) =Pi(«) ~6p 1 (n) 

Q P = V -*? 


From Equation (13) 
From Equation (19) 

From Equation (14)' 
From Equation (20) 


The only remaining differences are in notation. 

This coincidence of two models based on quite different psycho- 


logical assumptions is significant, but the case illustrated here is a 
special one, without risk, with the assumption of equal consequences 
of alternative occurrences, and without any assumption of work involved 
in responding. When these several assumptions are not met, the two 
models may yield different predictions. 

Let us now look at a single illustration of the Bush and Mosteller 
model in use. Consider the course of learning in a Skinner box. One 
measure of learning is the total number of responses to extinction follow- 
ing various numbers of reinforcements. Data are available from the 
work of Williams (1938) for responses after 5, 10, 30, and 90 reinforce- 
ments. T hes e are plotted in Figure 59. The smooth curve is the one pre- 
dicted from the Bush and Mosteller model, but we need to retrace the 


steps to see how it was arrived at. 

Beginning with the operator in Equation (18), the change of rate 
of responding with time can be computed on the assumption that, start- 
ing with any probability p, the rate will increase according to this proba- 
bility changed by the operator. According to the mathematics of differ- 
ential and integral calculus, appropriate to changes of rate in a free- 
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response situation of this kind, an expression can be derived for the 
maximum rate of responding. 25 This turns out to be: 

Vtaa * = l + b/a (21) 

where V max = rate of responding with infinite time 

i o = maximum possible rate of responding when p = 1; "activity 
level” 

b/a = ratio of the constants from the operator 

What does this maximum mean? The final rate of responding in- 
creases with a , hence with the amount of reinforcement, and decreases 
with b, the amount of work required in responding (or other negative 
influences). Under most circumstances the maximum possible rate (o>) 



FICTJRE 59. Total extinction responses as a function of number of reinforcements. 
Data from Williams (1938). Smooth curve as derived from Bush and Mosteller’s 
learning model. After Bush and Mosteller (1951a), page 320. 

will not be approached, even with long-continued practice. For example, 
when a = b, the maximum rate will be V raax = 0.5a>. This formulation 
is missing from the Estes-Burke model, which does not take into account 
the work required in responding. 

By similar mathematical processes it is possible to derive an equa- 
tion for the total number of responses to some criterion of extinction, 

25 Details of the derivations here summarized very sketchily are found in Bush and 
Mosteller (1951a), pages 817-319. 
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provided only the rate of responding at the beginning of extinction (V e ) 
is known. This turns out: 


1 , V c 

mT = ¥ l0 Sv7 (22) 

where m T = total responses to a criterion of extinction 
b = constant from the operator 
V e = rate of response at beginning of extinction 
V/ = final rate at “end” or extinction (i.e., criterion for empirically 
“completed” extinction) 

Equation (22) fits the data of Skinner-type extinction curves. But 
we wish to go one step further in fitting the Williams data of Figure 59. 
We must express the total number of extinction responses as a function 
of n, the number of reinforcements. 

The original operator, if repeatedly applied, produces a curve of 
acquisition very similar to that of Estes and Burke in Equation (16). 
In the notation of Bush and Mosteller we have: 

= (iTi) ” (tT6 -?)<!-«-&)• (23) 

where Q n p = probability of response after n reinforcements. 
a, b = the constants from the operator 
n = number of reinforcements 

p = probability of response at the beginning of reinforcement 

A transformation involving some additional steps that need not con- 
cern us rewrites Equation (23) in terms of rates of response at the be- 
ginning (V 0 ), when extinction is undertaken (V e ), and at the maximum 
(V max ): 


V. = V raax - (V max - V 0 ) (i - a - b)* (24) 

Note that to produce identity between the Estes Equation (17) and 
the Bush and Mosteller Equation (24) we need only make the follow- 
ing assumptions: 

Vjnax := at 

a—B 

b = 0 (for Estes does not consider the work factor) 

We return now to the problem of fitting the Williams data. The 
main relationships are contained in Equations (22) and (24). The 
second of these relates the rate of responding at the time extinction is 
begun to the number of prior reinforcements, while the first predicts how 
many extinction responses will follow this level of responding. All we 
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need to do is substitute Equation (24) into Equation (22) and we 
have our required equation. We have: 

( 25 ) 

Inspection of the Williams data led Bush and Mosteller to set 
V max /V/ = 5, and V 0 = V/. After this gross empirical step, the constants 
a and b were determined by curve-fitting, yielding a = .014 and b = .026. 
Substituting these values, Equation (25) reduces to the following: 

™ T= :di6 lo g{ 5 ~ [4]( - 96)n } (26) 

It is now clear that we have an equation in which m T , the total 
responses to extinction, can be computed by substituting a value for n , 
the number of reinforcements. This is the computation that produced 
the points from which the smooth curve of Figure 59 was constructed. 
The fit, as we see by inspection, is satisfactory. 

This one specimen can serve as the barest introduction to the Bush 
and Mosteller models, which have recently been published in book 
form. 26 

Relative Success of Stochastic Models 

The models of Estes, Estes and Burke, and of Bush and Mosteller have 
been quite successful in predicting the course of events within lever- 
pressing problems, simple runway problems, discrimination, and reactions 
to uncertainty. Data have been derived from both animal and human 
experimentation. 

Because of the common choice of an exponential function (essentially 
.of the form Hull found so convenient in his extensive curve-fitting), 
theories derived from a variety of assumptions are all rather successful 
in fitting empirical data. Hence success in curve-fitting does not permit 
choice from among the theories. A more penetrating question to be asked 
of them is whether or not the constants that show up in their equations 
are consistent. The real test of a * rational” equation begins with the inter- 
changeability of the constants from one situation to another. (If con- 
stants vary systematically, the varied constants must also be interchange- 
able.) A further test (for which contemporary theories are not yet ready) 
is to provide independent justification and measurement of the constants. 
For example, the speed of light shows up as a constant in many physical 

26 Bush and Mosteller (1955). The book had not yet appeared when this chapter 
was written. 
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equations. But the speed of light can be measured, and thus can give 
independent justification for identifying the constant with the speed of 
light. If Estes’ constant 0 represents some fraction of the total stimulus 
elements effective at once, there ought to be some way of demonstrating 
this directly. Similarly, Bush and Mostellers constants a and b ought 
to have independent justification through quantitative studies of motiva- 
tion and work, independent of learning. But this more searching analysis 
of the theories can wait until the success with interchangeable constants 
is demonstrated. 

Mosteller (1953) has presented a table showing the value of Estes 
and Burke s 6 as derived from a number of symmetrical two-choice 
situations, resembling the experiment of Grant, Hake, and Hornseth 
previously described, but also including some experiments arranged as 
a gambling set-up, in which the subject had to pay to play, and thus had 
additional motivation for desiring to win. Mosteller s table is reproduced 
as Table 6. 

Two things need to be said about this table: 

1. Of 22 determinations with rats, high school students, and college 
students, under a variety of conditions including many pay-off fre- 
quencies, the value of d is either .03 or .04 in 14 cases, and only four cases 
lie outside the range of .02 to .05. This is rather interesting agreement, 
considering that all the data are fitted in accordance with a single param- 
eter. 

A word of caution is in order. At the asymptote ir (or V raax ) the 
term that includes 6 (or a and b in the other system) approaches zero, 
so that the prediction of the value of the asymptote is independent of 0 , 
The value of $ specifies the rate at which the asymptote is approached, 
and a slight change in 0 may make a great change in the number of 
trials required to approach the asymptote. For example, let us set the 
quantity (1 — 0) n = .Ol. Then by substituting values of 6 and solving 
for n we can see what difference 8 makes in the number of trials required 
to approach to within a standard distance of the asymptote. By this cal- 
culation the range of trials required to approach the asymptote with a 6 
of .05 is 90 trials, whereas with a 0 of .02 the corresponding value is 
228 trials. Hence very small changes in 6 may produce significant differ- 
ences in the number of trials required to reach some criterion of 
mastery. 27 

27 Dr. Douglas H. Lawrence, who called my attention to the importance of small 
differences in 0, also points out that 0 varies more widely if the usual method of least 
squares is abandoned in favor of the method of maximum likelihood in fitting curves 
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TABLE 6. Value of Estes* 0 Computed from a Number of Symmetric Two-Choice 
Situations. After Mosteller, 1953, page 42. 


Pay-off Frequencies 

Experimenter , Subjects, Conditions 

Value of 6 

100:0 

Stanley (rats in T-maze) 

Jarrett ( Harvard students ) 

.04 


— pay to play 
— free play 

.11 

.05 

50:0 

Stanley (rats in T-maze) 

Jarrett (Harvard students) 

.04 


— pay to play 
— tree play 

.04 

.02 


Robillard (students) 



— rewards of 0-cents 

.04 


— rewards of 1-cent 

.03 


— rewards of 5-cents 

.03 


Cards ( high school students ) 

.04 


( Harvard students ) 

.07 

75:25 

Stanley (rats in T-maze) 

Jarrett ( Harvard students ) 

.04 


— pay to play 

.03 


Estes ( students ) 

.04 


Jarvik (students) 

.05 

100:50 

Jarrett ( Harvard students ) 



— pay to play 

.07 

80:0 

Robillard (students) 



— no rewards 

.06 

80:40 

Robillard (students) 



— no rewards 

.03 

67:33 

Jarvik (students) 

.03 

60:40 

Jarvik (students) 

.02 

60:30 

Robillard (students) 



— no rewards 

.03 

30:0 

Robillard (students) 



— no rewards 

.03 


2. The four cases in which the parameter disagrees strikingly with 
the rest are psychologically interesting, and pose important problems 
for theory. 

For example, in the 100:0 problem it makes a real difference whether 
or not the Harvard students had to pay one cent to operate a slot machine 
in order to win one cent. Also the fact that 100:50 has consequences 

to empirical data. The method of maximum likelihood is more appropriate than the 
method of least squares because it weights more heavily the approach to the asymptote. 
The approach to the asymptote best tests the constancy of 0. 




the emergence of MATHEMATICAL MODELS 401 

different from 50:0 is psychologically interesting. From a probability 
point of view these are alike (provided we weight positive and negative 
cases equally): certainty on one side and a 50:50 chance on the other. 
Yet they are unlike in the pattern of responses they call out from the 
subject. These discordant results show that some supplementary as- 
sumptions are needed before the theories can be considered completely 
general. 

The models described are, of course, unfinished ones. The original 
model-builders are carrying on with them, and others have come along 
with new suggestions. For example, Restle (1955) has presented a 
variation of the model in which 8 is interpreted as the proportion of 
relevant cues. 28 This turns out to be a simplifying assumption, so far 
as discrimination is concerned, and leads to some satisfactory predic- 
tions. Restle’s work comes nearer to providing an independent measure 
of what is essentially Estes’ 8 , and shows circumstances under which 8 
values change. 


THE PLACE OF MATHEMATICS IN LEARNING 

With the advance of psychology as a science there will inevitably 
be an increased use of mathematics. Mathematics is the language of 
measurement, of precision, of prediction. Yet there is no royal road to 
scientific achievement and mathematics must remain subservient to 
observation and reflection. Instruments of precision in the laboratory are 
also the means of scientific progress, but we all know that in the wrong 
hands they may lead to narrowness and triviality instead of to advance. 
Therefore we need to examine mathematics critically as a powerful tool, 
but not a magical one. 

The Improvement of Data 

The relationship between theory and data is circular: without 
theories we do not know what data to look for, and without good data 
we cannot test our theories. 

We have been greatly helped in the improvement of our data through 
that branch of applied mathematics known as statistics. We have learned 
how to test the significance of differences, how to express reliability 
and validity of scores, how to use correlation as a measure of relation- 
ship, and factor analysis to comprehend a table of intercorrelations. We 

28 The concept of relevant and irrelevant cues is discussed by Lawrence ( 1950 ), with 
experimental justification for the distinction. 
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have gone ahead to learn more about the design of experiments so that 
results can be secured both economically and in a form suitable for 
statistical treatment. Statisticians have helped us also in the construction 
of scales by which data can be ordered, and we have learned to dis- 
tinguish between interval scales and ratio scales. 

These services of statistics are reflected throughout the experimental 
study of learning. Because through the better application of statistical 
methods the yield of data is better, the learning theorist is helped. He 
needs satisfactory data if he is to test with any precision the predictions 
from his theories. 

The Improvement of Theory 

Theory has often had a rather bad name, being associated with 
"idle speculation.” Modem theory, closely in contact with data, is es- 
sential to the advance of psychology as a science. Just as statistics helps 
the experimenter to go about his business, so the contemporary mathe- 
matical models help the theorist who wishes to bring order out of exist- 
ing data and make it possible for him to suggest new experiments that 
need doing. 

The advantages of models have been summarized by Estes (1953) 
as follows: 

1. Information incorporated in a model is in a form suitable for pre- 
diction. 

2. The results of different experiments often become more nearly 
comparable when referred to the same formal model. 

3. The quantitative analysis of data may be facilitated by the model. 
Hidden or confounded variables can often be unraveled if the mathe- 
matical model is a good one. 

4. Because of the rigor engendered by the model, theoretical as- 
sumptions can be given more precise test than without the model. 

Mosteller (1953), in commenting on Estes' paper, goes further into 
the advantages of mathematical models over ordinary curve-fitting, He 
points out that with a good model a great many questions can be an- 
swered on the basis of a very few numbers (perhaps as few as two or 
three computed parameters). Some types of questions that he says might 
be answered are: 

1- What is the average trial number of the last failure? 

2. What is the average length of runs of failures? 

3. What is the standard deviation of the trial number on which the 
tenth success is achieved? 
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4. What is the probability that a man who has failed on trials 7, 8, 
and 10 will succeed on trial 14? 

He points out that routine curve-fitting would require new calcula- 
tions direct from data in order to answer each of these questions. He 
concludes: "Thus, in a reasonable sense the few constants plus the model 
can contain nearly all the useful information in the original data.” 

The purpose of the model is clear enough. It provides a formal 
theoretical structure into which data can be fitted, and it permits the 
statement of testable scientific propositions about the relationships 
among the data. 

Some Cautions 

The fascination and promise of mathematical theories are so great 
that a word or two of caution may be appropriate. That is, we need to 
keep all developments in proper perspective. It is not a question of being 
for or against the development of mathematical models. They should 
most certainly be developed, but in their appropriate relationship to 
other aspects of science. The following statements call attention to some 
of the considerations that ought to be kept in mind as mathematical 
theories are evaluated. 

1. The history of science tells us that some of the most important 
developments in the relation of mathematics to science came about with 
very simple mathematics. Thus chemistry and atomic theory generally 
were enormously advanced by the law of combining proportions (Gay- 
Lussac’s law), which stated simply that the combining volumes of gases 
entering into compounds could be stated as simple whole numbers. 
Thus the ratio of hydrogen and oxygen combining to form water is 2:1. 
What mathematics could be simpler? Another advance in atomic theory 
was Mendeleev’s periodic arrangement of the elements. He noted a 
kind of periodicity in which, with elements arranged in order of in- 
creasing atomic weights, certain elements resembled earlier ones in 
valence and other properties. Again, the simplest of mathematics, with 
the most profound results. Or, to choose another field, consider Mendel’s 
famous ratio of 3:1 for the types showing up after a given kind of mat- 
ing. Here is the background for the modern science of genetics. What 
these examples suggest is that scientific advance comes when the right 
concepts are discovered. The right concepts may prove to be beauti- 
fully simple and capable of very simple mathematical expression. The 
complicated mathematics becomes more useful later, after work goes 
on within the right concept system. 
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This is not, of course, a complete history of science, and sometimes it 
has required real mathematical genius to bring scientific advance. Had 
Kepler not been a skilled mathematician he could not have brought order 
out of the very complex mass of data left by Tycho Brahe. When by a 
series of curve-fitting attempts he found that he could fit the path of 
Mars with an elipse, his * model” created a kind of order until the more 
satisfactory “model” of Newton succeeded his. The fact remains, how* 
ever, that the search for the most appropriate concepts by which to 
harmonize scientific data often may not require complex mathematics. 

2. The verification of a scientific theory is a slow process, and a 
few successes do not suffice to assure the enduring value of the theory. 
As we have seen, theories starting out with different assumptions may ac- 
count for some of the same data. As long as competing hypotheses are in 
the running, further tests are needed to permit choice among them. A 
really poor model gets rejected early, but a plausible model may survive 
for some time before it is discarded in favor of a better one. 

3. A model is appropriate to a limited range of phenomena at most. 
This fact is ignored in two ways. The first way in which the limitations 
are ignored is by the model-builder who is tempted to generalize the 
applicability of his model beyond the data to which it is appropriate. 
This is all right, provided he is as cautious in his extrapolations as he 
is in his model-building, but the temptation is great to argue by analogy 
without testing the analogy as carefully as it needrto be tested. The 
second way in which these limitations are ignored is by the critic who 
brings in as a refutation of the model some data that the model does 
not propose to handle. 29 

4. Models starting with unlike assumptions may be complemen- 
tary rather than competitive. A model such as that of Pitts ( 1943), which 
follows the Rashevsky pattern in making assumptions about the nervous 
system as the basis for a model of conditioning and learning, may comple- 
ment a model such as that of Bush and Hosteller that is concerned with 
events going on at the level of learning data. In other words, it may 
turn out that more than one model is “true,” in the sense that it fairly 
represents reality when that reality is abstracted in a certain way. 

5. Mathematical models are not foolproof, even when the mathe- 
matics used is elegant and accurate. Mathematical arguments have to be 
scrutinized for fallacies just as verbal arguments do. It is usually easier 
to detect the error in mathematics than in verbal statement, but not 
always. The mathematical system is self-contained, but its appropriate- 

* 9 This point has been made by Spence ( 1952b). 
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ness for a given set of data always involves assumptions, some of which 
may be far from obvious. For example, more than one process may be 
symbolized by a common parameter. We have seen how die parameter b 
in the Bush and Mosteller model stands for inhibitory processes such as 
punishment and work. But this is a large order! Whenever b appears in 
an equation it is appropriately multiplied or divided or squared, but if 
punishment and work are not processes that act alike, this parameter 
does not fully represent them. Estes’ model is supposed to get along 
without assumptions about reinforcement or extinction, yet the operators 
distinguish between success and failure in a manner that looks suspi- 
ciously as though some kind of “feedback” is implied. The mathematical 
model is not going to save us from theoretical arguments, although it 
may perhaps save us from foolish ones. 

6. Because mathematical models are useful ones, it does not follow 
that they will pre-empt the field, and that no other kind of theory will 
be worth listening to. Guthrie’s theory, which provides the background of 
assumptions for Estes, was derived without benefit of mathematics. 
Freud’s various theories, possibly the most influential of all in the psy- 
chology of the twentieth century, were developed by observations un- 
aided with either instruments or mathematics. While mathematical 
models deserve prestige, they need not lead to the neglect of other kinds 
of theorizing. 


SUPPLEMENTARY READINGS 


BOOKS 

Relevant mathematical models from non-psychological fields 

Arrow, K. J. (1951) Social choice and individual values. 

Bross, I. D. J. (1953) Design for decision . 

Morse, P., and Kimball, G. (1951) Methods in operations research . 

Rashevsky, N. (1938) Mathematical biophysics. 

Shannon, C. E., and Weaver, W. (1949) The mathematical theory of communi- 
cation. 

Truxal, J. G. (1955) Automatic feedback system synthesis. 
von Neumann, J., and Morgenstern, O. (1947) Theory of games and economic 
behavior. 

Wiener, N. (1948) Cybernetics . 

Mathematical models in psychology and learning theory 

Bush, R. R., and Mosteller, F. (1955) Stochastic models for learning . 
Householder, A. S., and Landahl, H. D. (1945) Mathematical biophysics of 
the central nervous system. 

Hull, C. L., and others (1940) Mathematico-deductive theory of rote learning. 



406 THEORIES OF LEARNING 

Lazarsfeld, P. F., ed. (1954) Mathematical thinking in the social sciences . 

Thrall, R. M., Coombs, C. H., and Davis, R. L., eds. (1954) Decision processes. 

SHORTER INTRODUCTIONS 

Arrow, K. J. (1951) Mathematical models in the social sciences. In D. Lerner 
and H. D. Lasswell, eds. The policy sciences. 129-154. 

Bertalanffy, L. von (1951) Theoretical models in biology and psychology. 
J. Personality, 20, 24-38. 

Coombs, C. H., Raiffa, H., and Thrall, R. M. (1954) Some views on mathe- 
matical models and measurement theory. Psychol. Rev., 61, 132-144. 

Ellson, D. G. (1949) Application of operational analysis to human motor be- 
havior. Psychol. Rev., 56, 9-17. 

Grant, D. A. (1954) The discrimination of sequences in stimulus events and 
the transmission of information. A mer. Psychologist, 9, 62-68. 

Miller, G. A. (1953) What is information measurement? Amer. Psychologist, 
8, 3-11. 

Mowrer, O. H. (1954) Ego psychology, cybernetics, and learning theory. In 
Kentucky Symposium. Learning theory, personality theory, ana clinical re- 
search , 81-90. 

Spence, K. W. (1952) Mathematical formulations of learning phenomena. 
Psychol. Rev., 59, 152-160. 

REPRESENTATIVE PAPERS ON MATHEMATICAL MODELS FOR LEARNING TESTED 

AGAINST EMPIRICAL FINDINGS 

Bush, R. R., Mosteller, F., and Thompson, G. L. (1954) A formal structure 
for multiple-choice situations. In R. M. Thrall, C. H. Coombs, and R. L. 
Davis, eds. Decision processes, 99-126. 

Estes, W. K. (1954) Individual behavior in uncertain situations: an interpreta- 
tion in terms of statistical association theory. In R. M. Thrall, C. H. Coombs, 
and R. L. Davis, eds. Decision processes, 127-137. 

Flood, M. M. (1954) On game-learning theory and some decision-making ex- 
periments. In R. M. Thrall, C. H. Coombs, and R. L. Davis, eds. Decision 
processes, 139-158. 

Frick, F. C., and Miller, G. A. (1951) A statistical description of operant con- 
ditioning. Amer. J. Psychol., 64, 20-36. 

Graham, C. H., and Gagne, R. M. (1940) The acquisition, extinction, and spon- 
taneous recovery of a conditioned operant response. J. exp. Psychol, 26, 
251-281. 

Gulliksen, H., and Wolfle, D. L. (1938) A theory of learning and transfer. 
Psychometrika, 3, 127-149; 225-261. 

Hovland, C. L, and Weiss, W. (1953) Transmission of information concerning 
concepts through positive and negative instances. J. exp. Psychol, 45, 
175—182. 

Miller, G. A., and McGill, W. J. (1952) A statistical description of verbal 
learning. Psychometrika, 17, 369-396. 

Fitts, W. ( 1943; A general theory of learning and conditioning. Psychometrika, 
8, 1-18; 131-140. * 

Restle, F. (1955) A theory of discrimination learning. Psychol . Rev., 62, 11-19. 

Spence, K. W. (1936) The nature of discrimination learning in animals. Psychol 
Rev,, 43, 427-449. 

Thurstone, L. L. (1933) The error function in maze learning. ]. gen. Psychol, 
9, 288—301. 



Chapter 12 

CURRENT DEVELOPMENTS: 

I. REINFORCEMENT AND DRIVE 


I N an active field of research, with continuous interaction between ex- 
perimentation and theorizing, we do not expect the younger genera- 
tion to remain loyal to the past. In other words, we cannot expect the 
students of Thorndike to remain simply Thomdikian, those of Guthrie 
simply Guthrian, or those of Tolman merely Tolmanian. Yet we do find 
some of the preoccupations of the newer generation reflecting a back- 
ground of training under one or another of the earlier systematists. 
Contemporary developments can best be understood if they are reflected 
against a thorough understanding of the earlier systems. One role of 
history is so to familiarize us with persistent and recurrent issues that we 
gain perspective on the conflicts and quarrels of the present. Recent his- 
tory is no less important for this purpose than more remote history. Here 
lies the justification for the treatment of the separate systems in the earlier 
chapters of this book. In the process of historical development some of 
the earlier cleavages will become blurred as newer controversies take the 
center of the stage, and eventually there may be entirely new alignments. 

It is instructive to review a listing of learning theories made in 1926 
by Perrin and Klein. 1 They listed (1) pleasure-pain theory, (2) modified 
pleasure-pain (Thorndike), (3) confirmation-inhibition (Hobhouse), (4) 
congruity theory (Holmes), (5) drive or motor-set theory (Woodworth, 
Tolman), (6) completeness of response (Peterson), (7) frequency-recency 
(Watson), (8) intensity (Carr), and (9) conditioned-reflex theory 
(Smith and Guthrie). Many of the issues dealt with in these theories of 
1926 are still current, and many of the same names continue to appear in 
theoretical discussion today. But the lines of demarcation by which the 
theories could be grouped into major families had not yet become clear. 
Gestalt learning theory had not yet become influential in America, Toi- 

1 Perrin and Klein ( 1926), pages 218-243. 
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man bad not yet sharpened his views, and the positions of Hull and 
Skinner were still unborn. It may very well happen that in a few years 
from now the chapter divisions of this book will be as inappropriate to 
newer issues as the divisions of 1926 are today. The recent distinction be- 
tween the two families of theories, stimulus-response theories and cog- 
nitive theories, is already in the process of breaking down, partly be- 
cause of increased interest all around in problems of perception, partly 
because of the emergence of information-theory as a device for objecti- 
fying some aspects of cognitive processes, partly because of the awk- 
ward number of qualifications in respect to present theories, necessitated 
by the unceasing impact of new experimental data. 

Because of the rapidity of developments, a chapter of this kind can- 
not serve either as a thorough or an up-to-date review of the contem- 
porary literature. 2 All it can do is to call attention to some lines of 
development that appear promising or important, citing a few theoretical 
and experimental articles as indicative of the trends. 

THE INTERPRETATION OF REINFORCEMENT 

Within stimulus-response theories a basic issue has been the inter- 
pretation of reinforcement, dividing the systematists within the group 
into contiguity-theorists and reinforcement-theorists. These differences 
are often referred to the two patterns of conditioned response: classical 
conditioning and instrumental or operant conditioning. These paradigms 
are in turn correlated with two classes of laboratory experiment, that 
of Pavlov (“classical”) and that of Skinner (“instrumental,” “operant”). 
An experiment seldom provides a “pure case” representing a theoretical 
model, and the existence of the two kinds of conditioning experiments 
does not necessarily require two theoretical explanations of conditioning. 
Some psychologists, such as Guthrie, who adopt a theory more appro- 
priate to classical conditioning as a paradigm, believe that a response 
that occurs gets conditioned merely because stimulus and response are 
contiguous. Others, such as Hull, adopting the other paradigm, believe 
that contiguous occurrence of stimulus and response leads to condition- 
ing only when followed by another event: reinforcement. An out-and-out 
contiguity theorist will derive the results of reinforcement from his 
theory. Guthrie's solution, we recall, is to show that reward (reinforce- 
ment) merely prevents unlearning what was learned through contiguous 
association. An out-and-out reinforcement theorist will deny mere con- 

2 This purpose is served better by the reviews appearing in the Psychological Bulletin 
and the Annual Review of Psychology . 
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tiguous conditioning by arguing that some reinforcement (primary or 
secondary) is always present when learning occurs, even under the 
experimental arrangements of classical conditioning. Pavlovs dog, after 
all, was hungry and ate after salivating. A third position, advanced by 
others (e.g., Skinner, Schlosberg, Mowrer) is that the two kinds of con- 
ditioning may require two kinds of theory, one typified by learning under 
contiguity alone, the other by learning under reinforcement. These 
several possibilities all lie within stimulus-response theory. 3 

Not all reinforcement-systematists have a ‘'theory” of reinforce- 
ment. That is, one can accept the fact that certain kinds of events are 
reinforcing because they lead to the appearance of conditioned responses 
or to increase of habit strength, and let it go at that. 4 A learning theorist 
such as Skinner or Spence who is not interested in the physiological 
substratum of learning does not deny that learning goes on in the body. 
He may simply turn the problems of the physiology of learning over to 
someone else. So, too, the theorist interested in interrelationships among 
learning data may not concern himself about the intimate relationship 
between reinforcement and physiological needs. The commonest inter- 
pretation of reinforcement, however, can be traced back to Sherrington's 
preparatory and consummatory response. It now goes by the name of the 
drive-reduction theory. We have, of course, met this before, particularly 
in Hull's theory. The basic sequence is that deprivation produces a state of 
need in the organism. This need finds representation psychologically as a 
tension state known as drive, a state that energizes into action and produces 
stimuli associated with it. Thus the food-deprived animal shows the 
restless activity characteristic of the hunger drive, and the stimuli associ- 
ated with the drive lead to food-seeking. This restless state of tension 
is brought to an end when food is ingested, because food is “consumma- 
tely ” That is, food restores the bodily depletion, reduces drive, and re- 
moves the associated stimuli, bringing the cycle to an end. Primary rein- 
forcement is interpreted as related to the last portion of the deprivation- 
activity-satiation cycle. That is, a primary reinforcing state of affairs is one 
that reduces drive through satisfying need. We shall see that not all re- 

3 The issues are met so differently by cognitive theorists that it would be very diffi- 
cult to treat their views in this section. 

4 The term “reinforcement” may be used broadly or narrowly. We have favored 
the broadest and most nearly neutral definition of reinforcement, permitting it to 
refer to the presentation of the stimulus that serves as the unconditioned stimulus 
in classical conditioning or as the rewarding stimulus in instrumental conditioning. 
This broad use permits speaking of the number of reinforcements necessary before 
a conditioned response appears in classical conditioning. MacCorquodale (1955), 
page 48, recommends a narrower meaning, for he requires an antecedent response 
before it is permissible to speak of reinforcement. More specialized meanings of 
reinforcement in what follows will be clear from the context. 
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inforcement theorists hold that drive reduction is necessary for learning, 
yet drive reduction remains the basic paradigm for explaining why a 
reinforcing event is reinforcing. 

In the recent literature of reinforcement theory, secondary rein- 
forcement has come to play an increasing role. Even in Hull’s (1943) 
drive-reduction interpretation, secondary reinforcement did not have to 
be drive-reducing. Its effectiveness, however, was said to wear off unless 
it was ‘sustained by association with primary reinforcement. Hence the 
mechanism of primary reinforcement remained of central interest. 

In the early 1950’s a number of investigators began to report the 
results of experiments designed to study the reinforcement process more 
directly than it had been studied before. Suspicion became aroused that 
much that had been classed as primary reinforcement was really second- 
ary. For example, it takes a long time after eating before the food pro- 
duces nutritive changes in the body that will alter materially the tissue 
state associated with food depletion. Yet the reinforcing action is more 
prompt than this. Possibly even food — the standard incentive for “pri- 
mary” reinforcement — is actually yielding relationships based on secondary 
reinforcement, or, as an alternative interpretation, possibly need reduc- 
tion is not essential even to primary reinforcement. These doubts and 
queries have led to a variety of experiments and a variety of interpre- 
tations. 

v 

Sheffield's Prepotent-Response Interpretation 

In order to test the need-reduction theory of reinforcement, Shef- 
field and his collaborators designed experiments in which the incentive 
used as the reinforcement could be responded to but without satisfying 
the-physi-ological need to which that incentive was appropriate. Sheffield 
and^Kbby (1950) showed that the non-nutritive sweet taste of saccharin 
could serve as a reward in rat learning, even though no metabolic relief 
of hunger was involved. Later experiments showed that the reinforc- 
ing value of a sweet taste depended primarily on the amount of evoked 
consummatory activity (Sheffield, Roby, and Campbell, 1954). Sheffield, 
Wulff, and Backer (1951) showed similarly that naive male rats would 
learn to run and leap over hurdles in order to copulate with receptive 
females even though no ejaculation was permitted. The rate of learning 
depended upon the amount of copulation, so that prepotent act of copu- 
lation was important in the reinforcement sequence, even though the 
release of physiological sexual tensions through ejaculation did not oc- 
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cur. These experiments seriously question the importance for reinforce- 
ment of satisfying primary drives either through making up depletions 
(as in satisfying hunger) or through relieving excess tensions (as in 
discharges from the male sex organs). 

Sheffield, in offering the theory of prepotent responses, is very close 
to Guthrie, whose views he has often defended. He has used these experi- 
ments, in fact, in support of a contiguity interpretation of learning. 5 

Neal E. Millers Theory of Stimulus-Intensity Reduction 

Because, for Miller, any strong stimulus can become a drive and im- 
pel to action, his drive-reduction theory of reinforcement is really a 
stimulus-intensity-reduction theory. 6 

We have learned that it is too much to expect to find a crucial 
experiment that will resolve our disputes once and for all. Millers reply 
to Sheffield’s experiments is contained in the results of a series of experi- 
ments in which the consequences of food in the mouth can be compared 
with food in the stomach as reinforcing agents. If food injected into the 
stomach is reinforcing, this is obviously a threat to Sheffield’s prepotent 
response theory. For example, using the stomach-fistula method, Miller 
and Kessen (1952) taught rats a simple T-maze when milk followed 
choice of the correct side and normal saline solution followed choice 
of the incorrect side. When the incentive substances were received by 
mouth the animals learned faster, but they also learned when the rein- 
forcement was received by stomach (Figure 60 ). 7 The authors believe 
that their results support the drive-reduction theory and refute Sheffield’s 
prepotent-response hypothesis. That learning was faster by mouth would 
be expected from the support given to the learning via secondary rein- 
forcement. 

In developing his form of drive-reduction theory. Miller has strug- 
gled particularly with the problems created by learned drive (fear) based 
upon pain. The empirical situation is well established. It is possible to 
shock a rat in the white compartment of a two-compartment box, and 
arouse fear. Placed again in the box, without shock, he shows all the 
signs of a “conditioned” fear. Given an opportunity to jump a hurdle, 
turn a wheel, or in some other manner find his way out of the compart- 
ment to a “safe” one in which he has not been shocked, he ‘learns” this 

* Sheffield (1951). 

« Miller and Dollard (1941), Dollard and Miller (1950), Miller (1951a) (1951b). 

7 Saccharin has been shown to reduce subsequent food consumption when taken 
by mouth, but not by stomach fistula, by Miller, Murray, and Roberts (1955). 
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mode of escape. The drive reduction that serves as the reinforcement is 
the reduction of fear. 8 To hold consistently to a drive-reduction theory of 
reinforcement, it is necessary to account for the rat's fear of the compart- 
ment in which he was shocked. It does not do to fall back upon classical 
conditioning and to account for the rat's agitation as a substitution of the 



FIGURE 60. Correct choices to milk side when fed by mouth immediately and 
with delay, and when fed by fistula. Miller and Kessen ( 1952), page 560. 

previously neutral compartment for the shock that caused the agitation 
in the box-shock trials. Miller recognizes that in order to be consistent 
he must assume that the agitated responses are reinforced as all other 
responses are, namely, by stimulus reduction, hence in this case by 
termination of the painful stimulus. He adopts this interpretation of what 
happens, though he confesses some uneasiness about it. 9 

Because strong stimulation is not necessarily based upon a need 
state. Miller's theory has a genuine advantage over Hull's in that it 
yields a unified interpretation of primary and learned drives and of pri- 
mary and secondary reinforcement. In his words: 

Thus the basic mechanism of motivation (strong stimulation) is the same 
for primary and learned drives, and the basic mechanism of reinforcement (a 
reduction in strong stimulation) is the same for primary and learned rewards. 10 

8 Miller (1948a). See also Brown and Jacobs (1949). 

9 Miller ( 1951a ) , page 375. 

10 Miller (1951b ), page 440. The attempt to provide a demonstration of a learned 
drive based on hunger has not proved successful, Myers and Miller (1954). 
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Mowrer s Two-Factor Theory 

Mowrer early saw the possibility that much learning could be ac- 
counted for on the basis of reinforcement motivated by acquired fear, 
or by anxiety, as he then called it. His was the first clear statement of 
the anxiety-reduction or fear-reduction theory of reinforcement. 11 Re- 
flecting upon the same problem that bothered Miller, the problem of how 
fear is acquired via pain, Mowrer later came to reject the drive-reduc- 
tion theory, and returned to the familiar stimulus-substitution theory of 
classical conditioning. But he saved the drive-reduction theory for instru- 
mental learning, and thus came to espouse a two-factor theory. 12 

The interpretation of learning as a dual process was not new. We 
met it in Thorndike’s associative shifting and learning under the law of 
effect. We met it also in Skinner’s two types, Type S and Type R. A some- 
what similar proposal had been made by Schlosberg (1937). Mowrer’s 
espousal of the dual theory was important, however, because it repre- 
sented a conversion to the position by one who had been rather closely 
identified with the drive-reduction unified theory. He thus set himself 
in direct opposition to the views of Miller, although both men were pro- 
ducing related experimental results. 

Mowrer, in elaborating the two-factor theory, has used the term 
sign learning for simple conditioning of the stimulus-substitution or as- 
sociative-shifting variety, and solution learning for trial-and-error learn- 
ing under reinforcement. He writes: 

In order to make a two-factor theory work, “conditioning,” it now appears, 
must be restricted to the process whereby emotions, meanings, attitudes, ap- 
petites, and cognitions are acquired, with “solution learning” applying to all cases 
of overt instrumental habit formation. 13 

Mowrer believed that he found support for his theory in the differ- 
ences in function between the autonomic and the central nervous system. 
Emotional responses, of which fear is an example, are largely involuntary 
and are chiefly under the control of the autonomic system. Problem-solv- 
ing responses, instrumental responses, are largely voluntary, skeletal, 
under the control of the central nervous system. Hence it would not be 
surprising if in the course of evolution different principles of learning 

11 The early paper is Mowrer (1939). It is reproduced with later comments by its 
author in Mowrer ( 1950). He and Miller have long been concerned about very similar 
problems, and while they were together at Yale they collaborated on a demonstration 
apparatus to exhibit the learned behavior in which they were both interested, Mowrer 
ana Miller (1942). 

12 Mowrer (1947) (1950) (1951) (1954b). 

13 Mowrer (1951), page 352. 
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might have developed, their dissimilarities showing in the differing 
activities of these two systems. 

The issue is sharp between Mowrer ’s interpretation and Millers. 
The latter, holding to his unified theory, runs somewhat up against the 
intuitions of ordinary experience. That is, we seem ordinarily to be ap- 
prehensive about the expected onset of pain, and it is hard to believe 
that our fears are strengthened because pain subsides quickly. In order 
to refute the theory that conditioning of fear depends upon the termina- 
tion of pain, Mowrer and Suter 14 performed an experiment in which a 
flickering light served as the conditioned stimulus. The subject could 
avoid a shock by running to the opposite end of the apparatus during 
the first 5 seconds of the persisting conditioned stimulus. If however, the 
shock came on (as it did at the end of 5 seconds) there was no way 
to escape it, and it continued for 10 seconds. The two conditions of the 
experiment were (1) terminating the conditioned stimulus at the onset 
of the shock, and (2) terminating the conditioned stimulus at the end of 
the shock. Mowrer argued that Millers theory should produce greater 
conditioning under condition (2), since the end of shock is reinforcing; 
whereas his theory would predict no difference in the two methods, since 
coincidence with onset is common to both, and is all that matters. The 
results confirmed Mowrer’s deductions from his theory, in that there was 
no significant difference between the two groups. 

Miller (1951a) rejected the interpretation of the Mowrer and Suter 
experiment, basing his case in part on criticisms by Kendler ( 1951 ) and 
Sheffield (1951), and on some results of Sullivan (1950) that appear to 
support his own theory. We must leave this issue as unresolved. 15 

Mowrer s theory is currently undergoing revision, but the new theory 
is too recent for critical exposition here. The two factors of the original 
version have given way to two new factors: incremental fear conditioning 
(secondary motivation), and decremental fear conditioning (secondary 
reinforcement). In the new theory, all learning is sign learning. 16 

Spence’s Interpretation of the Role of Reinforcement 

In his recent formulation of his position in the Silliman Lectures at 
Yale, Spence (1955) has reiterated his lack of active interest in how re- 
inforcement works, for example, whether or not it is drive-reducing or 

** Reported in Mowrer (1950), pages 278-287. 

16 For additional evidence on Mowrers theory, see Mowrer and Solomon (1954) 
and Mowrer and Aiken (1954). On Millers side, see Zeaman, Deane, and Wegner 
(1954). 

** Mowrer (1955). 
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need-reducing. He has preferred all along to adopt the empirical law 
of effect (in this agreeing with Skinner) and then to deal with quanti- 
tative relationships, using (unlike Skinner) an approach via intervening 
variables. 

Because Spence's views are commonly held to be close to those of 
Hull, he is careful to point out what differences there are between his 
general theoretical preferences and those of Hull. He makes four points: 

1. In the first place, the formal hypothetico-deductive style, as used 
by Hull and others (1940), did not appeal to him as appropriate at this 
stage of learning theory. Along with Bergmann 17 he showed that Hulls 
theoretical successes lay rather in the intervening-variable approach, and 
it was the article in which they pointed this out ( and the extensive cor- 
respondence then being carried on between Hull and Spence) that led 
Hull in his 1943 book to characterize his method in terms of intervening 
variables and to borrow this term from Tolman. 

2. Spence felt that Hull went farther than necessary in attempting 
to give neurophysiological specification to his intervening variables. 
Although Spence prefers to remain neutral with respect to physiological 
interpretations, he would have no objection, of course, to genuinely fruit- 
ful physiological concepts. 

3. Whereas Hull adopted reduction in drive strength (or reduction 
in drive stimuli) as essential to reinforcement, Spence did not commit 
himself on these issues, holding instead merely to the empirical law of 
effect. 

4. Finally, Spence prefers to develop his position step-by-step, with 
his ventures into theorizing always very close to data. Hull was rather 
more willing than he to make guesses in the form of “postulates" on the 
basis of minimum empirical evidence. Spence's objectives are stated thus: 

That is, the initial objective is to develop a set or system of concepts and 
postulated interrelationships which, in combination with the different initial and 
boundary conditions specific to each experimental situation, will provide for the 
deduction of the several kinds of conditioning laws that have been obtained. It 
is hoped further that this theory, along with the necessary additional composition 
rules, will permit one to account for more complex learning phenomena. 18 

Spence indicates two reasons why, over the years, he was not actively 
interested in the mechanism of reinforcement. The first was his interest in 
quantification, which he repeatedly pointed out did not require a theory 
of reinforcement. 19 The second was the difficulty that showed up in 

Bergmann and Spence ( 1941 ) . 

18 Spence (1955), quoted from manuscript (Lecture 3). 

19 E.g., Spence ( 1952b). 
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experiments on latent learning of ever finding an experimental situation 
totally free of reinforcing events. It was the effort to explain the various 
things happening in latent-learning experiments that led to an interpre- 
tation of reinforcement important in Spence’s contemporary theorizing. 20 

The contemporary theory accepts the association of stimuli and re- 
sponses through their contiguous occurrence. Events at the goal are 
important. The conditioning that takes place in the goal-box is reinforced 
(as Pavlov’s dog’s salivation is reinforced) and no special theory of this 
reinforcement is presented. That is, for conditioning to occur, the sub- 
ject must respond to the goal-object either by consuming it, or by 
perceiving it. This reinforcing act produces the conditioning of frac- 
tional goal responses (r G ) to stimulus cues throughout the response 
chain of the activity prior to the goal, and these conditioned responses 
(f G ) and the stimuli they produce (s G ) serve as important integrators 
of behavior, as they do in Hull’s theory. The instrumental responses 
themselves, in this chain, are strengthened merely because they occur, 
and not because they are reinforced through some act at the end of the 
chain. The reinforcing act at the end of the chain strengthens the r G ’s, 
and these affect earlier links of the chain as motivators rather than as 
contributors to habit strength. Thus the r G -s G mechanism becomes iden- 
tified in Spence’s theory with Hull’s incentive motivation (K) rather 
than with habit strength (H). Differences in reaction potential of re- 
sponses at different distances from the reinforcement end of the behavior 
chain are thus due to differences in the K factor rather than to differences 
inH. 

Because reinforcement may be necessary for classical conditioning 
and is not decisive for instrumental learning, Spence toys with the no- 
tion (without committing himself to it) that he may have a two-factor 
theory just the opposite of Mowrer’s: 

It is interesting to note that if one were to adopt a theory that reinforcement 
(e.g., need reduction) plays a decisive role in the acquisition of habit strength of 
classical conditioned responses but not in the case of instrumental conditioned 
responses, one would have a two-factor theory that is exactly the opposite of the 
well known two-factor theory espoused by Schlosberg, Mowrer and others. 21 

Because Spence’s theory is becoming a major competitor to other 
theories, and is already leading to relevant experimental work, it is de- 

20 The theory has been developing over some years, and has been mentioned in 
the writings of some of Spence's students (e.g., Kendler, 1946). It was first elaborated 
more^fully in Spence, Bergmaim, and Lippitt (1950). See also Spence (1954a) 

21 Spence (1955), quoted from manuscript (Lecture 5). 



CURRENT DEVELOPMENTS: I 417 

sirable to see an example o£ how it works out in quantitative studies. 
The theory does not, in fact, commit Spence to an interpretation of the 
mechanism of reinforcement for classical conditioning. It does, however, 
require that he demonstrate that the K factor (based on anticipatory goal 
responses) acts as drive does upon excitatory potential rather than as 
habit strength does. 

If K acts as a drive, it will have a multiplicative relationship to H 
in the determination of E. That is, other things being equal, 

E = HxK (1) 

where E is reaction potential 
H is habit strength 

and K is the incentive motivator based on the fractional goal response 

Equation (1) above is written in terms of intervening variables. 
What are the experimental variables correlated with these intervening 
variables? The diagram in Figure 61 shows how experimental variables 
and resulting behavior are related to the intervening variables. 
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FIGURE 61. Relation of experimental variables to intervening variables in the 
determination of K as a multiplier of habit strength. Spence (1955), Lecture 5. 

Habit strength is assumed to be related only to N, the number of 
response occurrences, whether goal responses or not. Hence we are in- 
terested primarily in the determiners of K, if we are to understand E. 
These are as follows: 

N g = the number of classical conditioning trials, i.e., the number 
of times the subject enters the goal-box and responds to 
(consumes or sees) the goal-object. 

W G = amount of goal substance consumed (number of pellets, 
weight or volume of food). 

X x . . . X N = a number of as yet unknown variables that presumably de- 
termine the vigor of the consummatory response, e.g., the 
sweetness of the object, the amount of sucking effort required 
to obtain the object in the case of liquids, possibly the hard- 
ness of the goal-object, and so on. 
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The presumption is that experimental variations of reinforcing agents 
can be shown, by way of properly devised experimental tests, to affect 
the nature of K. If the habit strength of r c; is increased, K should be in- 
creased. The relevant K may also be affected by the vigor of the par- 
ticular r G that is strengthened, that is, the particular component that 
gets conditioned from out of all the activity going on in the goal box. 

The theory implies that the differences in reaction potential of re- 
sponses at different distances from reinforcement will be due to differ- 
ences in K rather than in H, and this theory leads to a number of de- 
ductions. In order to see how these deductions are submitted to test, we 
shall follow through one of Spence’s deductions. 

The theory, as it takes more specific form, assumes that K and D 
are additive (not multiplicative, as for Hull). This choice of addition 
rather than multiplication is not purely arbitrary, and if the wrong choice 
were made it would be corrected by the experimental consequences. 
The reasons for the choice need not now concern us, for the results will 
provide an indirect verification. The assumption may be stated: 

E = H(D + K) (2) 

Now we turn to an experiment. Suppose we are interested in die 
running speed of a rat in a 1-foot section of a straight runway, the 
section beginning 1 foot from the start, and 4 feet from the goal. This 
running speed is either reinforced as a habit by what happens in the goal 
box (Hull), or it is sustained by D and K, its habit strength being inde- 
pendent of events in the goal box (Spence). If the section, the same 
distance from the start, is 10 feet from the goal, we expect somewhat 
less reinforcement (and also somewhat less r G and hence K), so that 
running speed will be reduced there, as compared with the section be- 
ginning 4 feet from the goal. Expressed thus crudely, the Hull and Spence 

theories seem to lead to the same deduction. But the importance of a 
quantitative theory is well illustrated here, for the quantitative deduc- 
tions of the two theories differ. 

Expressed in reinforcement terms (Hull’s 1943 theory), the excita- 
tory potentials for shorter and longer distances from the goal will be: 

E s = H s x D (3) 

E l = H l x D (4) 

Subtracting the two equations, we find: 

E s - E L = D x (H g - H l ) (5) 

Equation (5) shows that the differences in excitatory potential, at 
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a given level of practice, will be a function of drive. Because running 
speed is assumed to be a linear function of excitatory potential, this 
could be tested experimentally. If (in appropriate units) drive level 
doubles, running speed should also double. This would be a simple 
Hullian inference. 

If we go through similar steps in accordance with Spences theory, 
we come out with a different inference. According to Spence: 

E s = H x (D + K s ) (6) 

E l = Hx(D + K l ) (7) 

Subtracting these two equations, we have: 

E s -E l = Hx (K s — K l ) (8) 

Equation (8), by contrast with Equation (5), shows that at differ- 
ent levels of practice (represented by H) the differences in reaction 
potential between the shorter and longer path will be independent of 
drive level, and a function solely of differences in K. 

The experimental test to determine whether Equation (5) or Equa- 
tion (8) holds is straightforward. Rim the rats with two delays under 
two drive levels, holding everything else constant, and see whether the 
differences in running speed under the two delays diverge or remain 
constant with changes in drive. If they diverge, the change with drive is 
more likely to be due to H, as implied in Hull's theory and Equation 
(5). If the differences remain constant, support is given to Equation 
(8) and to the interpretation that running speed depends on K rather 
than H. A preliminary test run in Spences laboratory yielded the results 
shown in Figure 62, confirming his views, as implied in Equation (8). 
The figure shows the results for 9 rats per group on the first daily trial 
over the last three training days (days 10, 11, and 12) for the four 
groups of subjects. 

We shall meet some other derivations from this theory in the next 
chapter. For the present, this suffices to indicate how the theory is 
translated into experimentable form. 

How can Spence’s theory best be characterized, in relation to other 
reinforcement theories? Let us quote his own words: 

This theory is, then, a reinforcement theory so far as excitatory potential is 
concerned; that is, the presence or absence of a reinforcer, and differences in its 
properties when present, do make a difference in the strength of the instrumental 
response. It is not, however, a reinforcement theory in the traditional sense of the 
term, for the habit or associative factor is not assumed to vary with variations in 
reinforcement. 22 

22 Spence (1955), quoted from manuscript (Lecture 5), 
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FIGURE 62. Test to determine whether or not drive level affects amount of dis- 
crimination between two delays. Because die differences at low and high drive remain 
constant, Spence's theory that D and K are additive is supported. Spence (1955), 
Lecture 5. 


Seward’s Two-Moment Theory of Motivated Learning 

In a series of theoretical papers beginning in 1950, 23 Seward has at- 
tempted to unravel the problems of drive and goal response in the learn- 
ing of instrumental behavior under both appetitive and aversive drives. 

The early statement of his theory had very much in common with 
Spence’s recent interpretation of reinforcement because the emphasis 
was upon the fractional goal response as a mediator, its role being facili- 
tation rather than association . This is roughly what Spence came out 
with: the incentive motivation is a component of drive, and hence acts 
as a multiplier of habit strength rather than a producer of it. Seward 
took as his point of departure Hull’s (1950a) revised postulates and 
Spence’s (1947) interpretation of delayed-reward learning. Because he, 
like Spence, was worried about latent learning, it is not too surprising 
that with these common starting points they should come out with 
similar theories. 

Seward’s general position is as follows: 

23 The chief papers in the series are Seward (1950) (1951) (1952) (1953). 
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1. All learning is based on a simple law of association, without 
reference to reinforcement. 

2. Conditioning of the fractional goal response is a special case of 
such learning, limited by the amount and delay of reward. 

8. This conditioning of the fractional goal response results in a con- 
struct (at first called tertiary motivation , but the term was later dropped) 
which enters into the equation for s Er as a multiplier along with D and 
sHr. 

Because this multiplier facilitates the act in progress, instrumental 
activity is affected by what happens at the goal. 

The modifications of Hull's position by Spence and by Seward are 
readily seen to be very similar. Drive-reduction reinforcement of instru- 
mental responses is dropped by both, and fractional goal responses enter 
as components of motivation. Beyond that the theories differ. 

Seward (1952) ran into trouble in applying his theory symmetri- 
cally to learning based on either appetitive or aversive drives. 24 He intro- 
duced a modification in the form of a two-moment theory of motivation. 
As he reviewed the events in a cycle of motivated behavior, it appeared 
to him that most activity begins with something aversive and ends with 
something satisfying. This is, of course, coherent with a common inter- 
pretation of the need-drive-incentive pattern, for drive is conceived as a 
state of disturbed homeostasis, while the incentive restores the equilib- 
rium. The problem reduces, then, to two subproblems: how drives are 
acquired and how goals are acquired, with the implications for appe- 
titive and aversive learning. Through ordinary conditioning processes, a 
drive surrogate ( rs ® ) is acquired as well as a goal surrogate ( rsa ) • These 
are the two moments of motivation which are now said to contribute to 
stimulus-intensity dynamism of a motive (V m ), as follows: 

V m = a(rs G + rs D ) + k 

This V m is a multiplier of habit strength, as in Hull's equations. 25 

The advantage of the two-moment theory is that relative weights can 
be assigned to rs G and to rs D to account for appetitive and aversive prob- 
lems. In problems such as escape from pain and pain avoidance, empha- 
sis is placed upon rs D , with rs G playing a minor role. The interpretation of 

24 Spence has not yet dealt specifically with escape and avoidance learning in the 
development of his theory. 

25 All varieties of stimulus, whether produced by external events, by the organism’s 
movements, or by drive, must have both cue aspects and stimulus-intensity dynamism 
aspects. Hence a great deal of arbitrariness enters into terms such as D, K, and V in 
Hull’s system or any derivatives from it. Seward could have chosen to tie his terms to 
K, as Spence does, instead of to V. 
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experiments such as those of Miller (1948a) and Brown and Jacobs 
(1949) rests almost entirely on acquired fear, a form of rs D . But in the 
case of instrumental learning for reward, as in food-rewarded hunger, 
there has been little success in finding learned drives (rs D ), 26 so that 
most of the burden is taken on by learning related to the goal response 
(rso). The separation of these two moments of motivation is a very 
natural development from familiar interpretations of drive and goal, and 
it appears to open up a useful line of investigation in harmonizing learn- 
ing under reward and punishment 

Concluding Remarks on Reinforcement 

We come out with post-Hullian interpretations of reinforcement as 
follows: 

1. Single-factor theories 

a. Contiguity only (Sheffield, Seward) 

b . Drive reduction only (N. E. Miller) 

2. Dual theories 

a . Drive reduction for instrumental acts only (Mowrer) 

b. Drive reduction for classical conditioning only (Spence) (?) 

It is evident that agr^ment has not yet been achieved! 

&EEDS, DRIVES, AND INCENTIVES 

We have seen that the problems that have arisen in connection with 
the reinterpretation of reinforcement are also problems in the interpreta- 
tion of motivation. 

The need-drive-incentive pattern, in its simplest form, asserts that 
physiological needs are created by deprivation (or by tissue injury, as 
in noxious stimulation),! and that these needs give rise to drives. A drive, 
in turn, goads to activity, and may also guide activity, until an incentive 
(goal-object) is encountered. Response to the incentive reduces the 
drive by satisfying the need, that is, by making up for the deprivation 
or by getting rid of the noxious stimulation. It is possible to hold to this 
general pattern of events as a model of the motivational cycle whether 
or not one accepts a reinforcement theory of learning. While this pattern 
serves as a crude approximation to what actually happens, recent studies 
show that the details are very complex indeed, and there are some who 
have come to believe that, as a motivational model, the need-drive- 
incentive pattern is actually misleading. 

28 Seward (1951), Myers and Miller (1954), 
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Drives as Correlates of Physiological Needs 

We may define “need” as the physiological state of deprivation or of 
tissue injury, and “drive” as the psychological consequence of this state. 
We know, in general, that the longer we have abstained from food, the 
hungrier we get, and that rats deprived of food tend to run faster in an 
activity cage than well-fed rats. As we examine the details of the need- 
drive correlation, however, we find that we cannot assume a one-to-one 
correspondence between need level and drive level. 

We have already seen how Hull (1952a) introduced an inanition 
factor to account for the fact that as animals become weakened by de- 
privation their food-seeking drives are also weakened. Prolonged starva- 
tion is thus one exceptional condition that disturbs the relationship be- 
tween need and drive. 

Other qualifications of the need-drive correlation are shown by the 
experiments on artificial drive reduction, where need is not satisfied (as 
in saccharin feeding), or where a common amount of need reduction 
leads to varying amounts of drive reduction (as in the fistula-feeding 
experiments). 27 

There must be some sort of “feedback” mechanism within the nervous 
system that tells the organism when it has had enough to eat or to drink, 
prior to the metabolic assimilation of the food or liquid. The search for 
such mechanisms helps to throw light upon the relationship between 
need and drive. We have met the suggestion by Heron (1949) that dis- 
tention of the stomach, whether by water or by food, seemed to be one 
basis for drive reduction (somewhat independent of need reduction). 
If Herons theory were true, it would not deny the usual correlation be- 
tween need and drive, for usually the stomach distention would be 
produced by substances appropriate to the satisfaction of need. But 
doubts concerning the generality of Herons results have been raised by 
Miller and Kessen (1954), who inserted a small balloon on the end of 
the stomach fistula of rats prepared as in their previous experiments. 
They could now compare the behavior of the rats when the stomach was 
distended by milk with their behavior when the stomach was distended 
by the inflated balloon. They found in experiments with a T-maze that 
the results did not support Herons claims. The milk served as a reward, 

27 In addition to the experiments by Miller and Kessen (1952), already cited, we 
may note the related experiments by Hull, Livingston, Rouse, and Barker (1951), 
Kohn (1951), Berkun, Kessen and Miller 1952). The experiments have been re- 
peated for the thirst drive by Sampliner and Woodrow, in Millers laboratory (unpub- 
lished). 
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and led to the choice of that arm of the T that was followed by milk 
injection; the inflated balloon served as a punishment, and led to avoid- 
ance of the side that led to balloon inflation. Hence stomach distention 
alone is not the condition resulting in drive reduction of either hunger 
or thirst. 

Another feedback mechanism has been located in the region of the 
hypothalamus, clearly affecting the thirst drive. Andersson (1953) showed 
that the injection of a saline solution into the third ventricle of the brain 
(the region of the hypothalamus) produced excessive drinking in goats. 
The experiment has been repeated and extended in Millers laboratory 
with cats as subjects. 28 Hypertonic saline solution (2 per cent NaCl) in- 
jected into the ventricle produced excessive water consumption, while 
pure water produced a minimum of drinking. Isotonic salt solution fell 
between the two. The amounts injected were very small (0.15 cc. of 
sterile solution). This suggests a sort of “detector” mechanism in the 
nervous system that may prove to give important clues as to how the 
organism . adjusts its liquid input to its needs. 

JEfrives as Goads to Activity 

It has usually been assumed that restless activity is associated with 
heightened drive, and the activity-cage performance has sometimes been 
used as an indicator of drive level. The relationship has been studied 
anew by C ampbell an d Sheffield (1953). They kept twelve rats in a 
constant environment with unvarying external stimulating conditions for 
seven days, four days with food always present and three days deprived of 
all food. In such a constant environment there was very little change in- 
general activity as a result of food deprivation. When, however, sounds 
were introduced or changing visual stimuli were provided, the activity 
level increased, and the amount of increase was noticeably affected by the 
level of deprivation, as shown in Figure 63. The authors conclude that 
hunger drive produces lower thresholds of response rather than internal 
stimulation to activity. The result is plausible in view of the well-estab- 
lished findings that bodily deficiencies result in selective food preferences, 
a result that might come about through a lowering of thresholds for 
specific kinds of stimulation. Total prolonged absence of food would 
correspondingly lower thresholds for almost any kind of stimulation. 

28 Richter and Southwick (unpublished). 
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FIGURE 63. Effects of deprivation on “spontaneous” activity as compared with 
activity when an environmental change is introduced. Campbell and Sheffield ( 1953), 
page 321. 


Drives as Cues to Differential Response 

In addition to whatever drives do to reduce thresholds or to stimu- 
late activity, they also help to direct activity through providing various 
differential cues. For example, an animal can be taught to turn one way 
when hungry, another way when thirsty. Bailey and Porter (1955) have 
taught cats to perform one instrumental response to get water when 
thirsty, and another to get food when hungry. Eight of nine cats were 
highly successful in making the discrimination. 

Bailey (1954) performed an experiment with rats to determine 
whether or not hunger or thirst could provide differential cues for a 
discrimination rewarded not by food or water but by escape from intense 
light. The rats had to press one of two panels in order to turn off the 
light. Which panel was correct was determined by which drive was 
active. Thus the drives served as cues exactly as if they were different 
figures on the panels. Three groups were able to discriminate ( a ) thirst 
from hunger, ( b ) thirst from satiation, and (c) hunger from satiation. 

In the same series of experiments, Bailey also used the presence or 
absence of a 10,000 cycle tone as a cue for the differential response to the 
panels. If the tone was presented shortly before the time of response, re- 
sults were very similar to those with drive discrimination. If the tone 
began to stimulate the rat a long time before he was placed in the 
apparatus, he did not learn. The difference between the long-delay and 
short-delay tone is used to interpret the difficulty sometimes found in 
drive-discrimination experiments, because drives are long-persistent 
stimuli. The argument that long-persistent stimuli are hard to discrimi- 
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nate is weakened by the finding that drive discriminations were no more 
difficult than short-delay tone discriminations. Supplementary conjec- 
tures are required, possibly about cue distinctiveness, if the original argu- 
ment is to hold. 

Drive Interaction and General Drive Level 

Though drives provide differential cues, all drives may have some- 
thing in common. Each may contribute its share to a general drive level, 
as implied in the theories of Hull and of Spence. This general drive level 
(D) is the multiplier used in their equations to produce reaction poten- 
tial from habit strength. The theoretical problem of a general drive level 
contributed to by specific drives is subject to a variety of experimental 
approaches. 

Verplanck and Hayes (1953) conducted a series of experiments on 
food and water deprivation in rats in which the animals later, under 
conditions varying from experiment to experiment, had an opportunity to 
eat and drink. They found a very complex picture of drive interaction 
as measured by eonsummatory activity. For example, the amount of food 
eaten by water-deprived rats is some 60 per cent of the amount eaten 
by those which have had continuous access to water, while the amount 
of water drunk by food-deprived rats is only 40 per cent of that drunk by 
animals which have had continuous access to food. If the animal is 
allowed to drink, after being deprived of water for 22 hours, so that its 
water needs are satisfied, it is motivated to eat about as much food as if 
it had been deprived of food for the same period. Similarly, termination 
of a fast, without water deprivation, leads to water consumption ap- 
proaching that of water drunk after a like period of water deprivation. 
The authors believe these results to be significant for studies of latent 
learning, which often try to distinguish too sharply between “hungry” 
and “thirsty” rats. 

Considerable success has been achieved in studies originating at 
the University of Iowa, and largely carried out there, which treat some 
generalized human motives according to the intervening variable (D), 
and then make predictions as to consequences for conditioning, dis- 
crimination, and verbal learning. Most of the studies have used a meas- 
ure of “anxiety,” based on Taylor's anxiety scale. 29 Others have used a 
.measure of achievement motive in somewhat the same way. 30 Farber 

29 A number of these studies have been listed in Footnote 32, Chapter 9. Theoretical 
summaries, in relation to drive theory, can be found in Farber ( 1954), Spence ( 1955). 
so E.g., Hurley ( 1953 ) . Cited by Farber ( 1953 ) . 
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(1953) points out that possibly achievement scores should not be used 
as indicative of drive level, but rather as indicative of a general response 
tendency influencing performance in a variety of situations. He suggests 
that it might possibly be referred to as an “achievement-oriented habit/’ 

In his very thoughtful review of anxiety as a drive state, Farber 
( 1954 ) rightly points out the successes that have been achieved through 
treating anxiety parsimoniously as generalized drive (D). He notes, 
however, that anxiety must be a special kind of D; it could not be 
claimed that just any heightened drive would lead to precisely the same 
consequences. His doubts about the status of achievement motive are 
perhaps equally applicable to anxiety. The detailed studies which show 
how test-measured anxiety acts as a cue or specific set of response ten- 
dencies (in addition to its enhancement effect as generalized drive) have 
not yet been reported. 31 

In developing the relationship between reaction potential, drive, and 
habit formation, the later theorizing of both Hull and Spence limited 
the effect of drive (D) to an influence upon reaction potential, not 
upon habit strength. An experiment by Campbell and Kraeling (1954) 
calls this into question. Rats trained to run in a straight alley under 
high and low strengths of hunger drive were tested in extinction trials 
according to two measures: resistance to extinction (measured by trials) 
and amplitude (measured by running speed). The Hull-Spence posi- 
tion was confirmed when resistance to extinction was used as a criterion, 
but with amplitude there was greater response strength during extinc- 
tion with stronger training drive level, regardless of the drive level dur- 
ing extinction. This experiment is merely one more showing how un- 
certain are our generalizations based on the drive-reinforcement para- 
digm. 32 

The Positive Role of Incentives 

The words “goal,” “lure,” “incentive” seem to imply some kind of 
directional orientation toward a state or condition that is attractive. We 
say that organisms strive toward goals. By a curious chain of circum- 

31 Spence (1955) reports some experiments to determine whether anxiety t of this 
kind acts as if it were a generalized excitement or whether it acts more as if it merely 
lowered the threshold for emotional expeiiences. Experiments in Eysenck's laboratory 
in London have shown selective effects for different personality types — contrasting a 
“neurasthenic” type with a “hysterical” type. 

32 Campbell and Kraeling point out that their experimental result also criticizes 
Hull's use of resistance-to-extinction and amplitude of response as alternative meas- 
ures of reaction potential, for the functional relationship of experimental variables to 
response differ depending upon which measure is used. 
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stances, this rather primitive nature of goal-experiences was largely 
lost sight of in the theorizing of the last decade or two. To be sure, it 
could not be ignored entirely, but striving and goal-seeking had to be 
derived from more primitive principles before the familiar facts could 
be recognized as valid. The circumstances that led away from attribu- 
ting positive qualities to incentives probably arose historically through 
the turn toward objectivity and away from conceptions that appeared 
tainted with animism or teleology. 

There is little doubt that the drive-reduction theory of motivation 
led to a predominantly negative role for incentives. According to this 
interpretation, food is eaten not because it tastes good but because it 
ends hunger-pangs. A compartment in which the rat is not shocked is 
reinforcing not because of anything attractive about its size, color, 
temperature, or fittings, but because it provides relief from a compart- 
ment in which shock has been given or is threatened. This negative 
position moved into the interpretation of human motivation as well. 
“At the level of ego-psychology,” writes Mowrer (1952), “there may be 
said to be only one master motive: anxiety” (p. 423). Not something 
positive, like love, or avarice, or ambition, to spur one on — just anxiety 
from which we seek to escape. 

A rather one-sided position of this kind often gains wide currency 
because of controversies that develop among those who hold the position , 
so that this point of agreement is not questioned. For example, this issue 
does not arise between Guthrie's contiguity theory and Hull’s drive-reduc- 
tion theory. Guthrie, too, was negative about the role of rewards, since 
all they did was change the situation. Psychoanalysis, which might be 
thought of as a somewhat competing position, tended also to describe 
adjustment negatively, adjustment depending upon anxiety reduction or 
escape from feelings of guilt. Even Lewin’s theory, which recognized the 
positive valences of some incentives, looked to tension reduction as the 
basis for motivated action. Support was given also through Cannon’s 
doctrine of homeostasis, which made it easy to view the function of 
motivation as the restoration of equilibrium through reduced tension. If 
all these factions accepted some form of tension reduction or stimulus 
reduction or mere change toward equilibrium as the basis for understand- 
ing motivated action, it is not surprising that the role of incentives re- 
mained essentially negative. Concessions are always made to reality, so 
that it can be pointed out that there were some positive features assigned 
incentives in each of these positions. That is not the point: the Zeitgeist 
favored our seeing incentives not as providing something sought after 
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for what was inherent in the incentive, but something providing relief. 
The incentive was seen as an avenue of escape from pain, anxiety, tension. 

The drive-reduction theory or tension-reduction theory, predomi- 
nant in mid-century, has been coming under increasing attack and re- 
examination by its friends as well as by its enemies. 

1. There have, of course, been critics of the drive-reduction theory 
all along. Because of its inviting nature and its acceptance by otherwise 
conflicting factions among the learning theorists, most of those who ob- 
jected were “voices crying in the wilderness.” They failed to get much of 
a hearing. McDougall (1923) argued that striving toward a goal was 
satisfying, and reaching it was relatively unimportant; Allport (1937) 

( 1946 ) never did like the law of effect or the motivational theory related 
to it; Murphy (1947) wanted sensory and activity drives added to vis- 
ceral ones; Anderson ( 1941 ) wrote on the extemalization of drive; Young 
(1949) felt that we paid too little attention to appetites in our over- 
attention to drives. These views did not do very much to stem the tide 
of the prevailing theory, though they may have sown dormant seeds of 
doubt that have more recently sprouted. 

2. A rather important feature of the background for the present ques- 
tioning of the drive-reduction theory has come from within the camp 
of reinforcement theorists. The entering wedge came through experiments 
on secondary reinforcement, as when Wolfe’s (1936) chimpanzees ac- 
cepted poker-chips as rewards instead of food. When Hull in 1943 did 
not require drive reduction in order for a secondary reinforcing agent 
to strengthen habit, he opened a way for recognizing the positive roles 
of incentives, though he was reluctant to follow this path. Instead, by 
1952 he had made secondary reinforcers also stimulus-intensity re- 
ducers. He later introduced (K), incentive motivation, as a further move 
in the same direction, although K applied to primary incentives only. 

Spence’s incentive motivation, related to Hull’s K, while supposedly 
separable from the general drive level (D), is additive to D, and hence 
functions exactly as a component of drive. Spence’s theory gives recog- 
nition to a somewhat neglected role of incentives : their drive-enhancing 
role as well as their drive-reducing role. At least for perceived incentives, 
naturalistic observation tells us that they can enhance drive, as when a 
male dog is aroused by a receptive female in the neighborhood. Pre- 
viously experienced incentives, as in Spence’s theory, act similarly through 
the arousal of fractional anticipatory responses, even though the in- 
centive is not directly perceived. The incentive, in its role of enhancing 
drive, takes on qualities not suggested in its role as a drive reducer. 
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The drive-reduction theory is naturally strained if the same object 
can both enhance drive and reduce it. The paradox is partially resolved, 
however, if we recognize that more than one response is possible to the 
same object. Thus the smell of fresh bread may increase appetite, while 
eating the bread may reduce it. In the case of secondary rewards the 
problem is more difficult, for they are obviously not need-reducing, so 
their drive-enhancing and their drive-reducing qualities are harder to 
distinguish. Miller (1951a), struggling with this problem, suggested that 
perhaps the two effects alternated in some manner, Mowrer (1947), fac- 
ing the same problem, found it necessary to give up the drive-reduction 
interpretation as a unitary one, and go to his two-factor theory. 

We have already seen how the experimental evidence makes con- 
fident assertions about the need-drive-incentive pattern less possible now 
than a few years ago. With this weakening of the orthodox position, the 
way is opened for a hearing from those who offer new interpretations. 

3. Some psychologists with strong physiological orientation have 
turned their backs on the orthodox drive theory of motivated behavior. 

Hebb (1949) favors an interpretation of hunger as an organized 
neural activity in the cortico-diencephalic system. It can be aroused ( 1 ) 
by the sight and smell of food, and the act of eating, (2) by stomach 
contractions, and (3) conceptually. Because these several determiners 
may summate, hunger can be aroused or increased by an attractive ap- 
pearance of food, or by its smell and taste. He thus emphasizes the 
drive-enhancing role of incentives beyond their drive-reducing role. By 
emphasizing a "central motive state” (Morgan, 1943), the precise physi- 
ological condition of need becomes less significant and perceived incen- 
tives become more important. 

Harlow (1953) (1954) has come to emphasize curiosity as a mo- 
tive, on the basis of observations on monkeys in his laboratory. He finds 
that when he puts complicated devices in their cages they learn to 
take them apart, even though no “bait” (no obvious reinforcement) is 
used. They show a great deal of curiosity and will work hard merely 
for an opportunity to “look.” 33 

Reflections upon these experiments have led Harlow to make strong 
statements against drive reduction and physiological states as the basis 
for motivation; 

There are logical reasons why a drive-reduction theory of learning, a theory 
which emphasizes the role of internal, physiological-state motivation, is entirely 
untenable as a motivational theory of learning. 7 

ss Butler (1953); Harlow and McCleara (1954). 
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The condition of strong drive is inimical to all but very limited aspects of 
learning — the learning the ways to reduce tension. . . The hungry child is a 
most uncurious child, but after he has eaten and become thoroughly sated, his 
curiosity and all the learned responses associated with his curiosity take place. 34 

He interprets the evidence with monkeys to indicate that strong 
drive is quite as likely to inhibit learning as to facilitate it These criti- 
cisms, coming from a physiologically-oriented psychologist, represent one 
of the extreme reactions to the position that shortly before was being so 
widely endorsed. 

It is possible that Harlow has gone too far. Events are complicated, 
and possibly some drive-incentive conditions are appropriately treated 
according to the drive-reduction pattern while others may not be. Fur- 
thermore, boredom may arouse a tension state that is relieved by any 
change in the environment, so that satisfying curiosity may be boredom- 
relieving, and thus drive-reducing. 35 

Montgomery has performed a number of experiments on the ex- 
ploratory drive in rats, a drive closely related to curiosity. While explora- 
tory drive interacts with other drives such as hunger, thirst, and fear, it 
cannot be explained by them. Novel stimuli, for example, may arouse 
both fear and exploration, but after the prolonged exposure has reduced 
fear the exploratory drive may persist. Montgomery’s hypothesis is sim- 
ply that novel stimuli arouse the exploratory drive that motivates ex- 
ploratory behavior. 36 

A position like Harlow’s and the related one of Montgomery’s re- 
ceives some support from Woodworth (1947), who recognized that a 
motive to perceive was a prevalent one, and from Nissen (1954), who 
accepts a <e biogenic drive to explore, to perceive, to know.” Another way 
that Nissen puts it is that capacity is its own motivation; an organism 
goes about functioning in the ways it is prepared to function. 37 

Still another support from the camp of physiological psychologists 
for a more positive interpretation of what happens at the goal comes 
from the experiments of Olds and Milner (1954) on positive reinforce- 
ment through electrical stimulation of part of the rat’s brain, the septal 
area below the corpus callosum and between the two lateral ventricles. 

34 Harlow (1953), page 25. 

35 This is, as might be expected. Miller's reply to Harlow, e.g., Myers and Miller 
( 1954 ) . See also Brown ( 1953 ) . 

36 Montgomery (1952) (1953a) (1953b) (1953c), Montgomery and Monkman 
(1955). 

37 The amazing disturbances in human subjects restricted experimentally to an un- 
stimulating environment shows something of this need for interplay with the environ- 
ment, Bexton, Heron, and Scott (1954). 
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Electrodes permanently imbedded could be made to deliver an alternat- 
ing current to this area if the rat pressed the lever of a Skinner box. The 
“shock” was found to be positively reinforcing. That is, the rat would 
press the lever as if being rewarded, provided the current was turned 
on. When the current was turned off, and pressing the lever had no ef- 
fect, typical extinction ensued. (Other areas of the brain can be found 
mildly rewarding, some neutral, some punishing. ) 38 

The experiment demonstrates that it is possible to reward an animal 
without satisfying a physiological need or withdrawing a noxious stimu- 
lus. Thus reward appears to be separable from need and drive. Further 
work along these lines may prove helpful in determining just how re- 
ward and punishment work. 39 At this point the significance of the find- 
ing is that incentives may have properties requiring their definition 
independently of need and drive. It is possible that the role of Olds’s elec- 
trical stimulation was that of affective arousal, that the stimulation 
produced a state that the rat found "pleasant.” If so, the study gives some 
physiological support for the theory next to be considered. 

4. A clear statement of the positive role of incentives, and a re- 
interpretation of the motivational sequence, has been given by Mc- 
Clelland and his collaborators. 40 It is a theory of affective arousal, a 
pleasure-pain theory, thus returning, in some sense, to a "law of affect” 
rather than a ""law of effect.” It cannot be dismissed simply as old- 
fashioned hedonism, for there is a genuine effort to deal with the problem 
of objectifying affective arousal. A motive is defined as ""the redintegra- 
tion of a cue by the change in an affective situation.” 41 A less cryptic 
definition, published earlier, defined a motive as ""a strong affective as- 
sociation, characterized by an anticipatory goal reaction and based on 
past association of certain cues with pleasure or pain.” 42 

McClelland’s criticism of drive-reduction theories of motivation is 
of what we have called above their negative character, that is, their fail- 
ure to take account of active pleasures and comforts as well as the relief 
of pain and discomfort. If we seek thrills, excitement, adventure, estheti- 
cally satisfying experiences, because of the quality of experience they 
invoke, McClelland believes we do well to reintroduce the affective as- 

38 Delgado, Roberts, and Miller (1954) have confirmed with cats learning through 
electrical stimulation of parts of the tectal area, thalamus, and hippocampus, but the 
mechanism appeared to be the arousal of fear ( or pain ) . 

32 Olds (1955) has examined some of these possibilities. 

40 McClelland (1951), McClelland, Atkinson, Clark, and Lowell (1953), Atkinson 
(1954). 

41 McClelland and others ( 1953), page 28. 

42 McClelland (1951), page 466. 
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pects of motivation. That is, he returns to the common-sense notion that 
we seek activities that are pleasurable because we enjoy the pleasurable 
state that they excite in us, and we dislike and avoid other activities 
precisely because they are annoying or unpleasant to experience. Hav- 
ing taken this general position, he is not willing merely to turn back the 
hands of the clock to an earlier hedonism. Instead he is attempting to 
build a new theory on scientific props of the same order of objectivity 
and validity as those supporting the drive-reduction theory. One of the 
facets of his theory is that slight changes in intensity of stimulation tend 
to arouse pleasant affect, extreme changes unpleasant affect. 

The main point is that McClelland has substituted affective arousal 
for drive reduction, and all motives are said to be cued by changes in 
affective situations. Furthermore, all motives are learned. The two mas- 
ter motives recognized by McClelland are appetite and anxiety. Con- 
tinuity with the prevailing theories is there, so far as anxiety is concerned, 
but added to it are the positive motives summarized under appetite. 

However the many suggestions and counter-suggestions will finally 
come out, we may be sure that the need-drive-incentive pattern will not 
emerge in uncomplicated form. Some satisfactory solution will have to 
be found for the treatment of positive wants and desires that impel men 
. to heroic and daring action. Only a few men see in the passive Nirvana 
state the true bliss for which they long. 

Summary of Reinforcement and Drive 

The confident espousal of a need-drive-incentive conception of mo- 
tivated behavior, and of a drive-reduction theory of reinforcement has 
given way to a searching analysis of the circumstances under which learn- 
ing occurs, whether through contiguous stimulation, drive reduction, 
stimulus-intensity reduction, some combination of these, or through some 
process as yet not understood. Secondary motivation and secondary rein- 
forcement studies are very important in these analyses. 

The most significant new direction in the study of motivation is away 
from tension-reduction interpretations to an emphasis upon the positive 
aspects of behavior related to preferred incentives or activities. 

SUPPLEMENTARY READINGS 

A list of supplementary readings on current developments is provided fol- 
lowing Chapter 13. 



Chapter 13 

CURRENT DEVELOPMENTS: 
II. SOME PERSISTING ISSUES 


rpHE issues around the need-drive-incentive pattern, as discussed in the 
-L preceding chapter, are all central to stimulus-response theories. The 
quarrels between stimulus-response theories and cognitive theories are 
somewhat more clearly represented in other issues. One of these, which 
experimenters have actively investigated for twenty years, has to do 
with the nature of discrimination learning in animals. This issue has been 
selected for review because it is not only central, but currently alive. 

Stimulus-response psychologists have often shown impatience with 
cognitive theorists, charging them with lack of precision in the statement 
of their theories. We shall therefore consider some recent attempts to make 
up the deficiency in formal statement of cognitive theories. 

Finally, there has been a renewal of interest in the old problem of the 
contribution of neurophysiology to learning theory. This is the third of 
the current developments selected for review. 

DISCRIMINATION LEARNING IN ANIMALS 

Although discrimination learning in animals had long been the sub- 
ject of laboratory study, new interest in the problem developed after 
Kohler (1918) (1925) called attention to the theoretical importance of 
relational discrimination and transposition, and another dimension of 
interest was added as Krech began to write about “hypotheses” in the 
presolution period in discrimination learning by rats. 43 

The theoretical issues between these interpretations and those of 
stimulus-response psychology were sharpened by Spence in two impor- 
tant papers which have led to a considerable literature of experiment 
and theory. 

The first of these 44 took up the problem raised by Krech, and showed 

43 Krechevsky (1932a) (1932b). 

44 Spence (1936). 

434 
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how reinforcement and extinction could account for the non-random 
features of presolution learning. This paper led to distinction between 
two viewpoints that have come to be known as the continuity and non- 
continuity theories of discrimination learning. Continuity theory accounts 
for the present in terms of past associative learning. Non-continuity 
theory predicts sharp breaks in learning as the animal learner shifts from 
one systematic mode of behavior to another, or achieves insight. The 
continuity theory may predict sharp breaks also, under specified condi- 
tions, but they are always predictable on the basis of the prior experi- 
ences of the learner. Spence was on the side of continuity, Krech on the 
side of non-continuity. 

The second of Spences theoretical papers 45 was concerned with the 
issue of discrimination based on the absolute versus the relative proper- 
ties of the discriminated stimuli, as brought to the fore by Kohlers trans- 
position experiments. Spence showed very ingeniously how the overlap- 
ping gradients of generalization of excitatory and inhibitory tendencies 
resulting from reinforcing responses to the positive member of the stimu- 
lus pair and extinguishing responses to the negative member could ac- 
count for transposition with similar stimuli. 

The nature of Spence’s analysis is illustrated by Figure 64. The tests 
for transposition are made after the gradients in the figure have been 
set up through reinforcing responses to 160 (the positive stimulus) and 
extinguishing responses to 100 (the negative stimulus). Transposition 
can be tested by selecting any pair of neighboring stimuli. If response is 
then to the larger of the two stimuli, transposition is demonstrated 
because the original training required response to the larger of the stimuli. 
Suppose we select the pair of stimuli 160 and 256. We learn from the 
diagram that the response tendency to 160 has a strength of 51.7 and to 
256 a strength of 72.1. Hence the tendency to respond to the larger stimu- 
lus (256) is greater than the tendency to respond to the smaller stimulus 
(160), even though the smaller (160) was the positive stimulus during 
training. Thus the fact of transposition is accounted for without any as- 
sumption of reaction to the relationship “larger-than.” 

Spence’s theory has the virtue over Kohler’s that it can also predict 
the known failure of transposition with extreme stimuli. Suppose stimuli 
409 and 655 are selected as test stimuli. Transposition (reaction to 
the larger) would require response to 655. But we learn from Figure 64 
that the tendency to respond to 655 is 18.5, while the tendency to respond 
to 409 is 52.1. Thus transposition will break down, and response will 
be to the smaller of the two stimuli. 

45 Spence (1937). 
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Because Spence’s gradients were generated entirely on the basis of 
the absolute (specific) characteristics of the individual stimuli, he set 
himself against theories of discrimination based on relative characteris- 
tics. 46 

These two problems (continuity versus discontinuity; absolute versus 
relative discrimination) are closely related, but they are sufficiently 
separable for us to consider independently the status of the issues they 
represent. 



STIMULUS SIZE * SQ.CMS. 

FIGURE 64. Spence's theory of transposition (Spence, 1942). The visual stimuli 
to be discriminated in either training or testing are scaled along the baseline accord- 
ing to their areas. 

Reinforcement of responses to stimulus 160 during training produces the gener- 
alization curve represented by the solid line. That is, the tendency to respond is 
increased not only for responses to stimulus 160 but to neighboring stimuli in pro- 
portion to their proximity to 160. 

Non-reinforcement (extinction) of responses to stimulus 100 produces the corre- 
sponding inhibitory gradient represented by the broken line. 

The tendency to respond positively to any stimulus along the baseline can be 
computed by subtracting the length of the dotted vertical line from that of the solid 
vertical line at that position along the baseline. The results of this computation are 
indicated by the numerical values above each stimulus size. 

The text explains how the diagram is used to explain transposition. 

Current Status of the Continuity and Non-continuity Issue 

The original paper of Spence (1936), denying the discontinuity be- 
tween presolution performance and solution learning, has been followed 
by many experimental pro’s and con’s, of which only the more recent 
will be reviewed here. 47 

Ehrenfreund (1948) conducted two experiments, the data of the 
first appearing to support the non-continuity theory, while the second 
supported the continuity theory. In the first experiment, rats were ulti- 

46 He later granted that some circumstances might force discrimination on the 
basis of relative characteristics (Spence, 1952c), but only if the relationships could 
not be learned more readily on the basis of the absolute characteristics of the com- 
ponent patterns. In this connection, see Nissen ( 1953 ) . 

47 A few of the more important earlier references, which cite the other literature, are: 
Kreehevsky (1938), McCulloch (1939), Lashley (1942), Spence (1940) (1945). 
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mately required to learn to discriminate between an upright and an in- 
verted triangle, having first been given 40 trials in which responses to 
the (later) negative stimulus were rewarded. They learned the ultimate 
problem as rapidly as a control group whose prior practice was not con- 
tradictory. Hence there appeared to be some discontinuity between the 
presolution learning and the ultimate discrimination. In the second ex- 
periment, however, the experimental group was retarded in its later 
learning by its earlier contradictory practice. The apparent paradox was 
resolved by Ehrenfreund by pointing out that the arrangements of the 
first experiment precluded discriminably different retinal excitations from 
the later-to-be-discriminated stimulus cards during the presolution period. 
The later-to-be-discriminated stimuli appeared on the upper portion of 
the cards, and the animals during the presolution period were forming 
position habits without “paying attention” to the figures. The second 
experiment, by contrast, required attention to the cards, because the jump- 
ing stand was raised and the rat jumped against the later-to-be- 
discriminated figures during the presolution period. Under the circum- 
stances of the second experiment the reinforcements during the presolu- 
tion period modified the later discrimination learning, as continuity 
theory would predict. Ehrenfreund concluded that his findings as a whole 
supported the continuity theory. 

The implied importance of “attention to the stimuli” has been given 
further study by Lawrence (1949) (1950), who makes a distinction be- 
tween “relevant” and “irrelevant” stimuli, and who believes that his results 
mediate some of the differences between continuity and non-continuity 
theories. A relevant cue is one consistently correlated with reward or 
punishment, whether alone or in combination with other cues. Thus 
Ehrenfreunds cues in the presolution period were relevant ones because, 
for the experimental group, only one of the pair of figures was ever re- 
inforced. An irrelevant cue is one that is presented inconsistently, some- 
times with reward and sometimes with punishment. The stimulus cards 
during presolution were irrelevant cues in Ehrenfreund’s control groups. 
Lawrence found that when an animal learns to discriminate between a 
pair of cues in one situation there is transfer to a new situation, even 
though entirely different instrumental responses are required. The cues 
acquire distinctiveness, so that new responses to them can be learned 
more readily. 48 These new responses include reversals of habit, so that 
an oversimplified continuity theory is not applicable. Lawrence’s inter- 

48 Lawrence has really restated and given new evidence for Robinson’s law of 
acquaintance (Robinson, 1932a, page 118). 
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pretation is in stimulus-response terms, but implies a mediating response 
learned to the cues. This mediating response makes the cues distinctive, 
so that a new association is more readily established between them and 
instrumental responses. Lawrence and Mason (1955) have reported a 
somewhat elaborately designed experiment involving three sets of cue- 
dimensions (black-white, high and low hurdles, and spatial position). 
They showed that with two cue-dimensions present, one may remain non- 
functional while the animal is responding to the other; when the first has 
become relevant again, the discrimination is picked up as though the 
reinforcement and non-reinforcement of these cues did not matter while 
the other cues were relevant. When three dimensions were involved, 
there was a tendency to fall into systematic position preferences when 
the cue relationships to reward were changed. The authors suggest that 
the greater the number of irrelevant stimulus aspects during habit re- 
versal, the greater the likelihood of systematic runs to dimensions that 
do not lead to solution. 

As so often happens after the heat of controversy cools, the at- 
mosphere of debate is changed to one of search for the conditions under 
which various experimental results are found. 49 It often turns out that 
the original issue was a somewhat artificial one. The suggestion that cues 
may acquire distinctiveness through prior experience supports some as- 
pects of continuity theory, but also moves the theory somewhat c]oser 
to the claims of the non-continuity theorists. 

Another suggestion, bearing on the continuity and non-continuity 
problem, has been provided by the experiments of Harlow on “learning- 
how-to-learn” or what he calls “learning sets.” 50 He finds in experiments 
with monkeys that, after practice, discrimination reversals can be made 
following a single trial. Because this “insightful” performance is achieved 
gradually, Harlow believes he has demonstrated a “continuity” relation- 
ship between insight and prior experience. He writes: 

The field theorists, unlike the Neo-behaviorists, have stressed insight and 
hypothesis in their description of learning. The impression these theorists give is 
that these phenomena are properties of the innate organization of the individual. 
If such phenomena appear independently of a gradual learning history, we have 
not found them in the primate order. 51 

While this statement of Harlow s somewhat distorts the insight prob- 
lem, which has to do with how experience is utilized rather than with 
the non-necessity of experience, his experiments are useful additions to 

48 See, for example, Blum and Blum ( 1949 ) . 

^ 50 E.g., Harlow ( 1949 ) . See also Bateson ( 1942 ) on deutero-learning, as a “learning 

81 Harlow (1949), page 65* 
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the literature on continuity and the modifications brought about by 
prolonged experiences with related problems. 

As an illustration of the trend to treat the discrimination problem 
on the basis of experimental variation, and at a relatively non-controversial 
level, Spence’s mathematical study of the effects of drive level and de- 
layed reinforcement on several varieties of discrimination learning pro- 
vides a good case. 52 

He is able to show by a manipulation of his intervening variables 
that sometimes drive level will be important in discrimination, some- 
times not. For example, if, by some kind of forcing, responses to the posi- 
tive and negative stimuli are kept equal, discrimination should be inde- 


pendent of drive level: 

E + =Hx(D + K) (1) 

E_= (HxD) - I (2) 

E + — E__ = KH + I, the D’s cancelling (3) 

In free responding, habit strength to the positive stimulus will gradu- 
ally exceed that to the negative stimulus. Then we have: 

E + =H + x (D + K) (4) 

E__ = (H_ X D) — I (5) 

E + — E_ = D(H + — H_) -f (H+K + I) (6) 


According to Equation (6), the influence of drive upon the dis- 
crimination will depend upon the stage of practice. The relationship can 
become quite complicated. Spence points out how important it is to 
specify the initial and boundary conditions, as well as the laws and 
hypotheses of a theory, when that theory is applied to a particular set 
of experimental relationships. 

Spence (1955) has been very successful in applying his quantitative 
approach to a variety of discrimination data. The analysis of the two- 
bar problem, when responses to one bar are reinforced twice as often 
as the other, serves as a convenient illustration. The apparatus used in 
the experiment was similar to that used by Logan (1952), and the data 
were unpublished from Ramond (1954). This discrimination problem 
does not involve non-reinforcement (extinction), so that it is conveniently 
studied according to D and K, without the complication of I (inhibi- 
tion). What happens is that, in free-choice trials, the bar whose re- 
sponses have been more frequently reinforced (hereafter called the more 
frequently reinforced bar) is chosen more often than the bar whose re- 

52 Spence (1955), lecture 7. See also Spence (1954a). 
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sponses have been less frequently reinforced, up to a maximum discrimi- 
nation. But as the training continues, reinforcements on the favored bar 
no longer add the increment that reinforcements on the less frequently 
reinforced bar do, so the responses to the second bar ‘catch up,” and the 
discrimination becomes poorer. Responses to the more frequently rein- 
forced bar under two levels of drive are given in Figure 65. Note that 
under both levels of drive the maximum is at the same stage of train- 
ing. This agrees with the theory, in that the maximum at any one drive 
level is determined by the differences in growth rates of H to the two 
bars, which is a function solely of the number of reinforcements. 



0 123456789 10 

BLOCKS OF FOUR FREE TRIALS 

FIGURE 65. Responses to the more frequently reinforced of two bars under two 
levels of drive. Spence (1955), Lecture 7. 

In the approach to their respective maximum values, the responses 
to the more frequently reinforced bar proceed at somewhat different 
rates under the two levels of drive, so there is a stage at which the dif- 
ference between the two response levels is a maximum. This falls in the 
fifth block of free trials of Figure 65, and represents the point at which 
there have been 36 reinforcements of the more frequently reinforced bar. 
Spence designates the maximum value (here 36) as N A . 

Now to return to the quantitative theory. Spence has computed the 
point at which N A should occur according to the theory, and it turns 
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out for this problem (where one bar is reinforced twice as often, as the 
other) to be a function solely of i , the coefficient of the exponent used in 
plotting the acquisition curve: 

N* = (1) 

l 

Because N A has been determined empirically from the data (N A = 
36), it is possible to compute i, which turns out to be .0167. Hypothetical 
growth curves, using this exponent, can be drawn for responses to the 
more frequently reinforced bar under the two conditions of drive, as 
shown in Figure 66, using the value of i determined from N A . 



FIGURE 66. Reaction potential to the more frequently reinforced bar under two 
conditions of drive (theoretical, but based on Na. = 36). After Spence (1955), Lec- 
ture 7. 

The theory can now be put to a severe test. Will this i, determined 
from these data, fit latency data obtained in the forced trials to the more 
frequently reinforced bar presented alone? Latency is first converted to 
running speed. Then using the relationship between E and running 
speed, as previously determined theoretically (Spence, 1954b), and plot- 
ting Ramond’s data accordingly, an acquisition curve with the exponen- 
tial coefficient i — .0167 turns out to fit satisfactorily. This then meets 
the desire for stable interchangeable constants pointed out in the chapter 
on mathematical models. 

This mathematical extension of Spence’s discrimination theory re- 
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mains a continuity theory, in that separate equations are written for re- 
sponses to the positive and negative stimuli, and no discontinuous emer- 
gent is anywhere implied. 

Current Status of the Issue Whether Discrimination Is 

Based on Absolute or Relative Properties of Stimuli 

Spence’s second paper (1937) contained the careful description of 
intersecting gradients of generalization and was therefore expressed in a 
form well subject to testing. Again a substantial literature collected, of 
which only the most recent papers will be reviewed here. 53 

Grice (1949) showed that rats could learn to discriminate equally 
well between two stimuli whether they were presented successively or 
simultaneously. It is evident that the successive presentation favors re- 
action to the absolute values of the stimulus rather than to relative values, 
so that he interpreted his experiment as showing that there had been 
no reaction to relationship in the simultaneous presentation. Baker and 
Lawrence (1951), building their experiment upon his, got comparable 
results, except that they carried the experiment a step further. Testing 
for transposition with novel stimuli not used in training, they found that 
the rats that had learned the simultaneous problem transposed better 
than the ones that had learned the successive problem. Thus there was 
some evidence for relational learning after all, when an appropriate test 
was used to discover it. 54 

Gonzalez, Gentry, and Bitterman (1954) studied the relational dis- 
crimination of intermediate size in the chimpanzee. Spence (1942) had 
earlier deduced the discrimination of intermediate size from his theory 
and had demonstrated successful discrimination, where Lashley had 
previously failed. Gonzalez, Gentry, and Bitterman went on to demon- 
strate transposition, with successful results which they say "point un- 
mistakably to the operation of a relational process in the discrimination 
of intermediate size by the chimpanzee.” 

* Lawrence and De Rivera (1954) developed a new type of transposi- 
tion experiment likely to prove fruitful in further studies of relational 

58 For some of the follow-ups of this paper see Hull (1939a), Tolman (1939) 
(1941), Spence (1942), Kuerrne (1946), Lashley and Wade (1946), Hull (1947b) 
(1950b). 

54 Unfortunately the two experiments were not exactly equivalent, and so the con- 
clusion remains somewhat in doubt. The simultaneous-successive type of discrimina- 
tion, in a different form, has been studied by Weise and Bitterman (1951 ), Bitterman 
and Wodinsky (1953), and others in their laboratory. A flurry of papers related to 
these studies appeared in the Psychological Review following a paper by Nissen 
( 1950 ) , in which he argued that approach-avoidance was the fundamental dimension 
according to which discrimination should be described. The papers are worth reading 
for the experiments described, but the points at issue were not very well clarified. 
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learning. Because of its novelty and potential fruitfulness it is worth de- 
scribing in some detail. In this experiment the animal learner always 
faced two identical cards exposed in the Lashley jumping apparatus. 
Each card was divided into an upper and lower half. The lower half 
during training always consisted of an intermediate gray (No. 4 in a 
scale from 1 to 7, with 1 being lightest). The upper half was either No. 

1, 2, or 3, lighter grays, or 5, 6, or 7, darker grays. The animal had to 
learn to jump to the right if the top half was lighter than the bottom half, 
and to the left if the top half was darker than the bottom half. Note that 
it would conceivably be possible to ignore the bottom half of the card. 
It was entirely indifferent, so far as reward and punishment were con- 
cerned. Furthermore, the top halves of the cards could have been 
learned according to their absolute brightness, for 1, 2, and 3 were al- 
ways reinforced when jumping was to the right, and 5, 6, and 7 when 
jumping was to the left. Thus the situation is not one that forces rela- 
tional learning, although it of course permits it. 

We may raise the question: When the rats solved this problem (as 
they did), was it on the basis of absolute or relative stimulus character- 
istics? The experimenters went on to a test with the grays in different 
combinations. Consider the simple expedient of placing the cards up- 
side down, with No. 4 gray always on top. Now with the lighter Nos. 1, 

2, or 3 on the bottom, the top card is darker , and so jumping should be 
to 'the left (if cues are relational) while jumping should be to the right 
if cues remain absolute. Correspondingly the darker Nos. 5, 6, 7, if on the 
bottom, make the top card relatively lighter , so relative response would 
lead to jumping to the right, while absolute cues would suggest jumping 
to the left. This particular form of transposition test yielded 74 per cent 
of the jumps relationally determined, well beyond the .01 level of sig- 
nificance. Some of the transposition tests lead to the same prediction by 
both the absolute and the relative theories. For example, suppose gray 
1 is placed above gray 3. Then the top is lighter, and both components 
have been associated with reward for jumping to the right. With such 
stimuli, 94 per cent of the jumps were in predicted direction. A third set 
of relationships, such as No. 2 over No. 5 and No. 6 over No. 3 makes pre- 
diction from the specific stimulus (absolute) theory ambiguous, for one 
stimulus component is associated with jumps to the right, one with 
jumps to the left. In such cases, instead of chance choices, 95 per cent 
were in keeping with the relational hypothesis. S5 

These recent experiments show that responses to relationships are 

55 This particular high-success level may also be accounted for on another basis. Be- 
cause in original learning onlv the top half of the cards was significant, the top half 
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clearly possible under circumstances in which specific stimulus cues are 
minimized , and relational cues emphasized. Again an old controversy 
dissolves in favor of a statement of the conditions under which one or 
another form of response occurs. Rats can react either to the absolute 
characteristics of specific stimuli or to stimuli in relation to each other. 
It is the circumstances that are important in determining which they will 
do. Spence (1952c) has accepted what is essentially this position. 

The mathematical treatment of discrimination does not require 
favoring either absolute or relational discrimination. While Spence’s 
theory (1937) was based on the absolute qualities of the stimuli, the 
mathematical treatment by Gulliksen and Wolfle (1938) was based on 
the relative qualities of the stimuli. 

The details of Spence’s theory of overlapping gradients have been 
subjected to recent criticism. Spence (1937) and Hull (1939a) (1950b) 
adopted gradients differing in curvature, Spence’s being concave down- 
ward and Hull’s concave upward. Lawrence (1952) studied the transfer 
of discrimination along a continuum. He found that learning was more 
efficient if training was first given on an easy discrimination and then 
shifted to the test discrimination than if all the training was given directly 
with the test stimuli. He found also that best results were obtained when 
the test discrimination was approached gradually. The results suggest 
that discrimination learning comes about not only through the genera- 
tion of habit strength through reinforcement of specific stimuli, but 
through the isolation of functionally relevant stimulus dimensions. 56 In 
a theoretical treatment of his results, Lawrence (1955) has shown dif- 
ficulties in the attempt to apply either Spence’s or Hull’s form of gradient. 
Although some form of supplement or modification of Spence’s theory 
may prove to be applicable, Lawrence was unable to provide a consistent 
set of statements about generalization gradients applicable to his own 
study as well as to other studies of discrimination learning. 

Several experimenters have followed their predecessors in extend- 
ing the methods originating in animal experiments to the study of dis- 
crimination and transposition in children. 57 These experiments help 
somewhat in mediating the problem of absolute versus relative discrimi- 

may have become “distinctive.” A top half prediction conforms to a relative predic- 
tion for these stimuli, so that one aspect of reaction to specific stimuli corresponds to 
reaction to relations, and hence a high per cent of successes might be anticipated. 

56 See in this connection the argument by Gibson and Gibson (1955) that in man 
perceptual learning comes about by differentiation of cues. 

57 For earlier studies, see, for example, Ivanov-Smolensky (1927), Jackson and 
Dominguez (1939). The later ones include Kuenne (1946), Alberts and Ehrenfreund 
(1951), Stevenson and Iseoe (1954), Stevenson, Iscoe, and McConnell (1955). 
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nation by showing that, in general, transposition is easier as the chil- 
dren become older. Some importance is attributed to the development of 
language, but the causal factors are obscure and alternative hypotheses 
are possible. One curious finding 58 is that a size-discrimination was much 
easier progressively from preschool years to the fifth grade, but then be- 
came more difficult up to the college years. The only plausible interpre- 
tation was that the older students were looking for a more difficult prob- 
lem and overlooked the very simple demands of the discrimination. 
This suggests (for human learning at least) theories emphasizing "set” 
or acquired cue distinctiveness over those based merely on the reward- 
ing and punishing of specific responses. 

These recent studies leave the problem of discrimination at a ma- 
turer level for analysis, beyond the stage of polemical debate in which one 
side or another scores a "victory.” There are systematic modes of solu- 
tion emphasized by the non-continuity exponents, and there are the 
cumulative results of experience emphasized by the continuity exponents. 
There are responses to specific stimulus components as advocated by 
those building their theories upon them, but there are also responses to 
stimulus characteristics defined in relational terms. 

THE FORMALIZATION OF EXPECTANCY THEORY 

Because of its more precise formal structure, stimulus-response theory 
has guided more research and theory construction in recent years than 
cognitive theory. Many of the problems set for stimulus-response psy- 
chologists, however, were posed by those from the other camp: for exam- 
ple, the distinction between learning and performance, latent learning, 
transposition, systematic modes of problem solution, secondary reinforce- 
ment (in the form of sign learning), incentive motivation (the valence 
of perceived incentives). As stimulus-response experimentation and 
theorizing has become more developed many of the challenging prob- 
lems which cognitive theorists first called attention to have taken on new 
interest for the stimulus-response theorists. For example, instead of deny- 
ing latent learning, new stimulus-response derivations are required to 
account for it; instead of denying relational discrimination, earlier theories 
are modified to include it. This is entirely wholesome, and in the end 
may lead to the kind of unification of psychological theory that we all 
wish to see. 

A few psychologists, many of them trained in stimulus-response ap- 
58 Stevenson, Iscoe, and McConnell (1955). 
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proaches, have become increasingly aware that the cognitive theorists 
were talking about psychological reality, even though their formalization 
of certain phenomena was inadequate. Some of them have turned their 
attention to the formalization of cognitive theories in a manner more 
nearly consonant with the kind of systematic treatment that stimulus- 
response theories have been getting. 

The fact that something may not yet have been done does not mean 
that it cannot be done. Miller (1949) has repeated the invitation of Hull 
(1930) to those friendly to gestalt psychology to do for cognitive theories 
what has been done for stimulus-response theory. Yet it must be remem- 
bered that little was done to formalize Thorndike’s theory between 1898 
and 1930. The motivations of psychologists have to be taken into account 
as well as the ripeness of their theories. Here lie some interesting prob- 
lems for the historian of science. 

Osgood (1950), acknowledging a preference for stimulus-response 
theories, tried his hand at applying Tolmans theory to avoidance learn- 
ing. He suggested four postulates consonant with Tolmans theory, and 
believed that they could be used satisfactorily to derive avoidance learn- 
ing. 

Olds (1954) set up a neural model for Tolmans theory, based on 
Hebbs (1949) discussion of cell assemblies. The model is sufficiently 
precise to allow for experimental testing, and one experimental study 
based on it has been reported (Olds, 1953). 

MacCorquodale and MeehTs Postulate System 

The most careful attempt to date to develop a Hull-like set of pos- 
tulates for a Tolman-like expectancy theory is that of MacCorquodale 
and Meehl (1953) (1954). Without arguing its truth-value they have 
taken the task of making explicit what expectancy theory would be like 
were it formalized. 

To begin with, an expectancy is said always to have three terms: 
(1) something initially perceived, called an elicitor of the expectancy 
(Sj); (2) something to be done following this perception, a response 
(RJ; and then something .that will be perceived, as the goal of the ex- 
pectancy, the expectandum (S 2 ). The process can be symbolized as 
S 1 -R 1 -S 2 in the defining of an expectancy just as the process S-R is 
used to define an association. The best way to remember the process is 
to think of some ordinary sentence, such as: "When this button (SJ 
is pressed (RJ, I expect to hear the ringing doorbell (S 2 ) ” This total 
process, prior to doing anything to the button right now, is the expectancy. 
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If ringing the doorbell becomes a goal (becomes valenced ), then the 
expectancy is activated. I push the button only if I want to ring the door- 
bell; I may have the expectancy (S 1 R 1 S 2 ) without doing anything about 
it 

Capital letters are used to describe the significant environmental 
events S^R^S.*, and the same letters may be used for the inferred con- 
struct, the expectancy (SJRjSJ. The letters are enclosed in parentheses 
when reference is to the construct rather than to events themselves. But 
whenever capital letters are used, whether in describing events or the 
inferred expectancy, it is appropriate to use physical-stimulus language. 
That is, S t and S 2 are objects in the environment, with their ordinary 
characteristics, and R x is a physical response. The specification of the 
terms of an expectancy in physical language permits not only objective 
verification, but also clear statements about confirmed and unconfirmed 
expectancies. (The logic of the physical language is no different from that 
used in assigning dimensions to Hull's s Hr. ) 59 

Twelve postulates are stated, often with guesses as to the form of 
mathematical function involved. For a briefer introduction to the flavor 
of the system, the postulates are here paraphrased, with much of the 
quantitative detail omitted. 

1. Mnemonization. The occurrence of the sequence S^R^S,, results in an 
increase in the strength of an expectancy (S^SJ . The rate of growth increases 
with the valence of S 2 . The limit of growth depends on the probability (P) with 
which S 2 regularly follows S 1 -R 1 . That is, growth approaches P as a limit. 

This is the basic acquisition postulate. It is a contiguity theory, 
in that mere occurrence of the sequences presumably leads to increase 
in expectancy. But it is also a kind of reinforcement theory, because the 
expectancy is influenced by the valence of the terminal item (S 2 ) in the 
sequence. It is not, however, a stimulus-response theory in the usual 
sense. MacCorquodale and Meehl point out that by their accepting speci- 
fied parameters for rate and limit of growth they have not made conces- 
sions to some form of non-expectancy theory. 

2. Extinction. The occurrence of a sequence S l -R l , if not followed by S 2 , 
tends to produce a decrement in the expectancy. 

Two cases have to be distinguished. If P has been 1.00 (that is, 
Sj-Rx always followed by S 2 ) a single omission of S 2 will lead to a 

59 Meehl and MacCorquodale allow for a central state called the expectant (sinsa), 
tut they make no use of it in their preliminary formalization. Presumably if one were 
to turn to neurophysiological explanations there would be some translations between 
the neurophysiological language of ( SiTiS 2 ) and the physical language of (S*RiSs). 
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prompt decrement, depending both on the current strength of (SJ^SJ 
and the valence of S 2 . If, however, P has been less than 1.00, the course 
of decrement will be slower, and, if P shifts to a lower P', the expectancy 
will approach the new value of P' as an asymptote. 60 

3. Primary-stimulus generalization. When an expectancy (S^SJ has been 
strengthened, other expectancies (S'^SJ will also have received some strength, 
depending upon the similarity of the elicitors S\ and S r 

Similarity gradients are an empirical matter, and there is no point in 
trying to distinguish the functions from those used by stimulus-response 
psychologists. Tolman (1939) recognized this when he built Spences 
(1937) gradients into his “schematic sowbug.” 

4. Inference. When an expectancy (S^SJ exists at some strength, the 
presence of a valenced object S* in close contiguity with S 2 gives rise to a new 
expectancy (SJ^S*). 

5. Generalized inference. Even though (S 1 R 1 S 2 ) is originally of zero 
strength, the occurrence of a valenced object S* in close contiguity with S 2 may 
give rise to a new expectancy (S 1 R 1 S # ), provided (S^S'J is of some strength, 
and S' 2 is similar to S 2 . 

These are the kinds of postulates said by MacCorquodale and Meehl 
to contribute heavily to the identification of an expectancy theory. They 
give the basis for latent learning and for the interpretation of experi- 
ments modeled after Maiers reasoning experiments. 

The next six postulates have to do with motivation, described ac- 
cording to needs, cathexes, and valences. I am taking the liberty of re- 
numbering them for expository purposes. They can easily be identified 
in the original source because their titles are unchanged. The three fol- 
lowing postulates state tire fundamental position with respect to motiva- 
tion. 61 


6. Cathexis. The cathexis (C*) of a valenced stimulus situation (S*) is a 
function of the number of contiguous occurrences between it and the consum- 
matory response. 

7. Need strength. The need (D) for a cathected stimulus (S°) is an in- 
creasing function of the time interval since satiation for it. 

8. Valence. The valence of a stimulus S* is a multiplicative function of the 
correlated need D and the cathexis C* attached to S*. 

60 We know from Humphreys’ (1939b) experiment that following a P of .50 there 
may be an increasing level of ( S 1 R 1 S 2 ) for a few trials if S 2 is regularly omitted. The 
MacCorquodale-Meehl postulate is incomplete, as they were the first to admit. Yet it 
is a few points, such as this increase in response in the first few trials of extinction, that 
provide some of the greatest support for an expectancy interpretation over other in- 
terpretations. 

61 The treatment covers rewarding (appetitive) motivation only. Pain and avoidance 
are not treated. 
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Postulates 6 to 8 imply a fairly complex theory of motivation, gen- 
erated ( as Spence’s recent theory was ) by the problems of latent learn- 
ing. The sight of food leads to food-seeking because of the correlation of 
sight of food with consummatory response (eating). This is the meaning 
of cathexis. Food thus gains a certain attractiveness ( cathexis ) through 
prior experiences of eating, but its present pulling power as an incentive 
(its valence) depends also on how hungry the organism is (its need 
level). Need is a function of deprivation; cathexis is a function of ex- 
perienced satisfactions in the past; valence is a product of both. There 
is a striking parallel with Hull and Spence’s D and K, which act together 
(as a product or as a sum) to influence reaction potential. 

Now we are ready for some postulates (actually earlier in the origi- 
nal list) concerning some further relationships between experience and 
cathexis. 

9. Secondary cathexis . The contiguity of S 2 and S* (a valenced stimulus) 
increases the cathexis of S 2 . 

This is a kind of “secondary reinforcement” or acquired incentive 
value, whereby the valence of one expectandum is given to a contiguous 
expectandum. 

10. Induced elicit or-cathexis. The acquisition of valence by an expectandum 
S 2 belonging to an existing expectancy (SjRjSg), induces a cathexis in the elici- 
tor S r 

11. Confirmed elicit or-cathexis. The confirmation o£ an expectancy (S 1 R 1 S 2 ), 
when S 2 has a positive valence, increases the cathexis of S r 

These two postulates mean that an elicitor, as a discriminative stimu- 
lus, comes to have reinforcing power. It is easily demonstrated that cues 
"at a choice-point, if similar to cues in the goal-box, facilitate learning. 
While usually interpreted according to secondary reinforcement, the no- 
tion of acquired elicitor cathexis is roughly equivalent. 62 

The difference between 4 induction” and “confirmation” is that be- 
tween introducing food after satiated runs, and learning while hungry. 

12. Activation. The reaction potential S E R of a response R x in the presence 
of S ± is a multiplicative function or the strength of the expectancy (S 1 R 1 S 2 ) and 
the valence of the expectandum. 

This final postulate gets around the jibe of Guthrie (1952, page 143) 
that Tolman leaves the rat buried in thought. The parallel with the Hull- 
Spence basic formulation is striking: 

62 Bauer and Lawrence (1953) show that the problem of choice-point cues and 
goal cues is complex. Conditioned inhibition may turn out to be as relevant as sec- 
ondary reinforcement. 
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Hull-Spence 

s E e = f( s H b ) X f(D, K) 

MacCorquodale-Meehl 

sE^fCSASJ X /(D, C*) 

As one goes on to specify in more detail the relations between re- 
action potential and response, there remain the problems of thresholds, 
oscillation, and the like. MacCorquodale and Meehl point out that there 
is no reason to handle these further matters differently in the expectancy 
model and the stimulus-response model. 

How successful are these postulates in deducing the experimental 
phenomena of learning? MacCorquodale and Meehl (1953) illustrate 
the procedure by deducing the phenomena from several experiments 
representing subvarieties of latent learning. 63 They succeed reasonably 
well in accounting for the data through manipulation of the postulates. 

The postulate set is avowedly weak on some matters that have been 
of particular interest to expectancy or cognitive theorists, such as the 
problems of multiple paths, systematic modes of solution (“hypotheses”), 
perceptual patterning. The authors believe that they have chosen a course 
more fruitful than that of Tolman himself in his emphasis on “maps” 
and “perceptions.” It is a useful beginning in showing, first, how a theory 
of this kind can be made explicit, including notational reference to re- 
sponse within an expectancy setting, and second, when made explicit, 
‘that there are large areas of agreement between a theory of this kind 
and stimulus-response theory. 

Rotters Social-Learning Theory 

Rotter (1954), in a book designed to provide a theory for clinical 
psychology, has evolved an expectancy-reinforcement theory with many 
points of resemblance to the MacCorquodale and Meehl model. 

His basic formula for behavior potential (B.P. ), the equivalent of 
reaction potential, makes it a function of expectancy of reinforcement 
(E), and the reinforcement value (R.V.) of the expected reinforcement 
That is: 

R.P. = /( E, R.V.) 

It will be recalled that in other systems the motivational term com- 
monly has two components, one reflecting need or drive, the other reflect- 
ing goal-activity or incentive value. Rotters theory concentrates upon 

68 The experiments are those of Kendler ( 1946), Meehl and MacCorquodale (1948), 
and Tolman and Gleitman ( 1949). 
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the incentive value, and the reinforcement value (R.V.) is measured 
directly by the preference for one goal-object over another when the 
possibilities of occurrence are equal. Something similar to Allport's func- 
tional autonomy of motives is implied: social goals are learned ones, and 
they need not be referred back to physiological needs. Instead, present 
reinforcements are best understood in relation to prior reinforcements: 

We find it a sufficient basis for prediction to state that behavior directed to- 
ward the attainment of a learned goal or external reinforcement may be predicted 
through a knowledge of the situation the organism is in and from a knowledge of 
his past learning experiences. It appears as if a theory that attempted to predict 
meaningful behavior through description of the organism's physiological drive 
condition at a particular moment would hold not only no advantages but many 
disadvantages. 64 

The theory has a rather different flavor from the familiar ones be- 
cause of its social, hence more molar, reference. For example, the list of 
six major needs brings newcomers to learning theory: recognition-status, 
protection-dependency, dominance, independence, love and affection, 
physical comfort. 65 

Another new concept is that of freedom of movement , which is re- 
lated to psychological need. It is defined as: The mean expectancy of 
obtaining positive satisfactions as a result of a set of related behaviors 
directed toward the accomplishment of a group of functionally-related 
reinforcements. 66 What this means is that high freedom of movement 
implies an expectancy of success for many different kinds of behavior in 
many different situations related to a family of common goals. Low free- 
dom of movement, of course, implies the opposite. 

Rotter’s theory has been subjected to a number of experimental 
tests in relation to its quantitative assumptions, but few of the results 
have yet reached publication, so that critical appraisal in relation to 
other learning theories must be postponed. 67 It is a significant effort at 
theory construction, however, and should be seriously considered because 
of its attempt to broaden the boundaries encompassed by our present 
quantitative theories. 

64 Rotter (1954), page 116. 

65 Rotter ( 1954), page 132. 

66 Rotter (1954), page 194. 

67 Some studies have appeared at the time of writing, e.g., Crandall (1951), Schroder 
and Rotter (1952), Rotter, Fitzgerald, and Joyce (1954). Other previously unpub- 
lished dissertations are appearing currently. 
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NEUROPHYSIOLOGICAL INTERPRETATIONS 
OF LEARNING 

Among the systematise whose views are presented in Chapters 2 
through 10, only Thorndike, Kohler, and Freud gave any appreciable at- 
tention to the neural correlates of their theories. Thorndike’s early “syn- 
aptic resistance” theory seemed no longer important to him in his later 
years. Freud’s influence has been functional rather than neuroanatomical. 
Only Kohler has continued to attempt to find physiological support for 
his principle of isomorphism. 

This relative lack of interest by learning theorists in the nervous 
system, particularly in the period of 1930 to 1950, is again a problem for 
the historian. No one denied the importance of the nervous system in 
learning. The denial had to do only with its importance for theories of 
learning. Possibly one influence was a certain disillusionment about 
progress in neuroanatomy and neurophysiology after the breakdown 
of the localization doctrine under many lines of evidence, including 
Head’s studies of aphasia and Lashley’s cortical ablation studies. If no- 
body really knew anything about how the brain worked, what help could 
present neurophysiology be to the learning theorist? So the argument 
ran. Perhaps another influence was the tendency for objective studies 
to favor “peripheral” rather than “central” interpretations. Guthrie and 
Hull were making the case for fractional anticipatory responses and 
movement-produced stimuli; Skinner was arguing that we stick to the 
data and avoid the c.n.s. (“conceptual nervous system”). When even a 
“centralist” such as Tolman had little interest in the nervous system, it 
was not surprising that the non-physiological position temporarily won 
die day. 

A reaction has now begun to be felt, and the relation of the nervous 
system to learning is awakening renewed interest. Although, to be sure, 
the interest had never fully died out, the renewal of widespread interest 
among the theorists can be dated by the appearance of Hebb’s (1949) 
neuropsychological theory. Hebb brought a freshness of approach, capi- 
talizing on recent developments in neurophysiology and introducing 
some daring speculations in a non-polemical manner that invited tractable 
consideration. Two specific events that marked the reaction away from 
the non-physiological period are some remarks in a discussion by Tolman 
(1949c) pleading for a more explicit neurological model, and a subse- 
quent insistence by Krech (1950) that we think of every hypothetical 
construct in psychology in purely neurological terms. 
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The Experimental Contributions of Neurophysiology to an 
Understanding of Learning 

A great many experiments have been devised through the years in 
an effort to locate the parts of the brain essential for carrying out learned 
activities. While results of interest have been secured, they have not led 
to any significant generalizations about the precise mechanisms of learn- 
ing. 68 

Although the new interest in neurophysiological theories of learn- 
ing also has thus far led to little that is of genuine explanatory value, a 
number of leads have been opened. Without attempting to appraise the 
vast literature, we may mention a few of the developments as indicative 
of some of the directions in which research is moving. 

1. Electrical exploration of the brain of human subjects under local 
anesthetics (especially with epileptic patients) has located some sites, 
particularly in the temporal lobe region, where vivid memories appear 
to be re-enacted under stimulation. 69 

2. The temporal lobes, particularly the amygdaloid nuclei, have be- 
come of special interest not only in studies of emotion, but also in the 
affective control of learning. Cats with these nuclei ablated required more 
trials to learn an avoidance response than cats with control lesions, but 
their retention was unaffected. 70 

3. Recent work, in part using the electroencephalograph, has called 
attention to a diffuse system by which the midbrain affects the cortex 
other than through anatomically defined sensory pathways. This diffuse 
system operates via the reticular substance, and permits incoming sensory 
stimulation, acting at a subcortical level, to waken the organism and 
incite it to activity. 71 

4. Working with amplifiers that can record resting potentials or 
slowly varying ones, it is possible to detect changes of potential on the 
surface of the skull at the occipital lobes corresponding in some ways to 
the patterning of visual stimuli. These results appear to give some sup- 
port for Kohler's field theory of excitation. 72 

5. Imbedded electrodes worn while animals engage in learning are 

68 Useful summaries of the major findings may be found in reviews by Marquis 
( 1951 ) , Morgan ( 1951 ) , and Osgood ( 1953 ) . 

69 Penfield and Jasper ( 1954 ) . 

70 Br1idy and others (1954). The problem was opened up by Kliiver and Bucy 
(1939). For a review of temporal lobe function see Milner (1954). 

71 E.g., Lindsley and others (1950), Magoun (1952). 

72 Kcmler and Held (1949), Kohler, Held, and O'Connell (1952). The theory sup- 
ported has been criticized experimentally by Lashley, Chow and Semmes (1951), 
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being used to study the effects on learning of stimulating various por- 
tions of the brain. 73 

6. Some chemical differences are found in the brain tissue of rats 
high in "hypothesis” behavior as contrasted with those low in "hypothesis” 
behavior. 74 

Interes tin g as these various leads are, they are mostly rather remote 
from the central problems of learning, although some of them bear on 
the problems of motivation and the nature of reinforcement. 

Hebb’s Speculative Neuropsychological Model 

Because of its influence in renewing interest in neurophysiological 
speculation, Hebb’s ( 1949 ) theory deserves special mention. 75 

Hebb recognizes that the basic problem is to understand thought , 
the something that intervenes between sensory and motor processes and 
is not fully controlled by environmental stimulation. He wishes to ex- 
plain thought in neurological terms and of course to do so without any 
shade of animism. He has no truck with a little man in the head who 
wants or disapproves or guides. Hence, while Hebb wishes a theory 
adequate to the phenomena of a cognitive psychology, he wants no soft- 
ness in it, and so ties it as closely as possible to what we know about 
neurones, their interconnections, and their functioning. The "central 
process” in the cortex must be better specified than in the telephone 
switchboard analogy of connectionists, or in the distribution of potentials 
across a homogeneous surface as implied by cortical-field theorists. 

Hebb’s two concepts most important for understanding cortical 
processes are the cell assembly and the phase sequence . 

A cell assembly arises through frequently repeated particular stimu- 
lation. It corresponds roughly to the persisting neural counterpart ("en- 
grain”) of a simple association. It is a diffuse structure comprising cells 
in the cortex and subcortical centers. When the particular stimulation 
again occurs, the cell assembly is aroused. Going into action as a closed 
system, it facilitates other systems and usually facilitates motor response. 
A cell assembly can thus be activated by another activated cell assembly, 
by sensory stimulation, or by both at once. 

The neuronic basis for a persisting reverberating circuit is specula- 
tive. Hebb suggests that stimulation repeated often enough to have a 

73 E,g., Olds and Milner (1954), Delgado, Roberts and Miller (1954). 

74 Krech and others ( 1954). 

75 No attempt is made here to review the many speculative models that have ap- 

peared, e.g., Ashby (1952), Cobum (1951) (1953), McReynolds (1950) (1953), 
Wolpe (1949) (1953). ' 
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lasting influence results in the growth of “synaptic knobs* that increase 
the area of contact between the axon and the tissue into which it fires. 
This comes only after repetition; the primary process creating the cell 
assembly depends on convergences of excitation in a rather diffuse 
neurone network, as a consequence of contiguous s tim ulation. Some 
equipotentiality is implied, because once a cell assembly is set up, 
fewer fibers have to be active at once in order to traverse the synapses 
necessary to arouse the cell assembly. 

A phase sequence is constituted by the arousal of a series of cell 
assemblies. For example, looking successively at the three comers of 
a triangle arouses the cell assemblies appropriate to each corner, and 
these facilitate each other. Thus the whole triangle may be perceived 
while attending to any one corner. Even the act of perceiving a triangle 
involves rather complex cortical facilitation, both sensory and motor; 
for example, we can anticipate (“expect”) the change that will come 
about by changing fixation from corner A to comer B. Thus the phase 
sequence is the thought process. 

At birth there is an intrinsic organization to cortical activity, indi- 
cated by the large slow waves of the infants electroencephalogram. 
For learning to take place, sensory processes have to interrupt this 
intrinsic organization and gain control of association-area activity. Hence 
the larger the association area (relative to the sensory areas), the more 
difficult it is for the young to learn. Hebb accounts on this basis for the 
slowness of learning by the human infant as compared with lower 
animals. 

The first stage of learning is the establishing of cell assemblies and 
their related phase sequences. At first this comes about slowly, and slow- 
increment learning is characteristic of childhood. But the results of such 
learning endure and are influential throughout life. 

Adult learning is essentially conceptual, often prompt and insight- 
ful. Such learning is possible, according to Hebb, only when the stimu- 
lation sets off well-organized phase sequences. 

This is the theory in broad outlines. Hebb’s book has been influential 
not only because of its neurophysiological ideas but because of the 
wealth of psychological observations reported in it. Unfortunately it is 
impossible to convey this richness of detail in a few paragraphs. 
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CONCLUDING REMARKS ON CURRENT DEVELOPMENTS 

The theoretical positions developed in the first half of the cen- 
tury have had their fruits not in the final triumph of any one system 
but in a more sophisticated approach to both experimentation and re- 
finement of theory. The vast volume of experimental work now appear- 
ing is almost all oriented to the checking of one or more hypotheses, 
and these hypotheses are imbedded in larger theoretical contexts. This 
kind of development tends at once to soften the clashes between sys- 
tematists as they attempt soberly to face common facts, and to improve 
fact-gathering by experimenters. The newer facts, oriented to more pre- 
cise theories, are more likely to be additive than they were in the past. 

We have reviewed the kinds of searching criticisms that are going 
on in current studies of reinforcement, of motivational aspects of learn- 
ing, and of the problems of learned discrimination. The efforts to sys- 
tematize “expectancy” interpretations come in recognition of the cogni- 
tive and directional aspects of learning. Nemo physiological speculation 
is again welcomed. These developments have not yet reached any cli- 
max, and no mediator has yet appeared to enlist agreement on some kind 
of harmonious interpretation of all the phenomena of learning/Yet prog- 
ress is being made, and there is reason to hope that we are meting toward 
the unification of theory. 
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Chapter 14 

RETROSPECT AND PROSPECT 


T HE array of theories we have examined may be thought to present 
an unfavorable picture of the state of systematic knowledge about 
learning. How can psychologists be helpful to other social scientists or 
to those who wish to apply their findings if they disagree among them- 
selves on these fundamental matters? It seems to us that there are several 
considerations which temper this adverse judgment. For one thing, there 
is a great deal of empirical knowledge about learning which is unrelated 
to the differences among the major points of view. For another, there are, 
in addition to the general theories, a number of special theories, more 
closely related to particular experimental situations. These special theories 
are tested and corrected in a matter-of-fact manner, cutting across the 
preferences of rival theoretical positions. Finally, discipleship in one or 
another of the major schools is not characteristic of most psychologists 
working in the field of learning. There are strong tendencies toward medi- 
ating positions. Many writers have found the major contributions of the 
different schools largely reconcilable . 1 

The disadvantages of conflicting points of view are balanced to some 
extent by the motivation which a strong position provides its proponent 
and by the challenge it presents to its adversary. Out of the heat of con- 
troversy there is eventually scientific advance . 2 Even in the midst of CQn- 
troversy, it is not necessary for the theorist to set aside reasonableness or 
critical powers. Theoretical differences show up most markedly where 
the data remain ambiguous, so that preferences hold sway over evidence. 
As the data become more securely established, it becomes increasingly 
possible to translate one system into another. 

While the state of knowledge is not therefore as bad as the parade 
of points of view makes it out to be, it is still rather unsatisfactory. There 
are no laws of learning which can be taught with confidence. Even 

1 E.g., Dashiell (1935), Kellogg (1938), McConnell (1942), 

2 Boring (1929). 
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the most obvious facts of improvement with practice and the regulation 
of learning under reward and punishment are matters of theoretical 
dispute. 


TOWARD A MORE SATISFACTORY THEORY 

A generally satisfactory theory awaits a set of concepts which will 
be appropriate to all that is known about learning. While each major 
theorist believes his concepts to be the appropriate ones, their perti- 
nence often derives from the fact that the theorist's interest is primarily 
in some segment or aspect of the total learning situation. Some are inter- 
ested in the initial adjustment, as in insight or problem-solving, others 
are interested in the predictability of performance under varied con- 
ditions of motivation, and so on. We need a more careful delineation of 
the kinds of learning which take place ( each of which may have “laws'' 
of its own), and an acceptable fractionation of the aspects of learning 
which make demands upon theory. Some theories are different largely 
because they are concerned with different problems to begin with. This 
search for the appropriate concepts is not merely an exercise in definition 
or classification. It requires a high order of theory construction, based 
on open-minded acceptance of demonstrated relationships, and in addi- 
tion contributing to the ordering of such relationships into a system. 

There is no single direction for theory to take. In fact, it is useful 
to recognize a division of labor in theory construction just as there is a 
division of labor in experimentation. Here are some of the needed direc- 
tions, and theory can be advanced by good work in any one of them: 

1 . More precise formulation and quantification . Ultimately science 
must be exact and quantitative. We have had occasion to review some of 
the newer model-building efforts in Chapter 11, where we cited both 
the advantages and limitations of this approach. 

2, Greater specification of mediating processes . The intervening- 
mediator type of theory has been forced into prominence by the factual 
relationships found within behavior. Simple correlations with end-terms 
do not suffice to account for the kinds of variation discovered in these 
relationships. 

Because this is so, two possibilities remain open. One is to stay with 
the functional analysis and to specify the intervening process solely in 
terms of the conditions that give rise to it. The mediating concept remains 
intangible, but it must be “triangulated" so as to be specified and meas- 
ured. It must have units of measurement that derive from the opera- 
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tions giving rise to it. MacCorquodale and Meehl (1948) named this 
kind of mediating process an * 'intervening variable.” 

The other possibility is to give the intervening process tangible 
or substantial status, and then search for it directly. MacCorquodale and 
Meehl call this kind of tangible intermediary a “hypothetical construct.” 
The experiments concerned with the need-reduction theory of reinforce- 
ment are of this sort; they are not studying the place of reinforcement 
in the equations of Hull, but they are going at the substance of a rein- 
forcement postulate. Similarly, the direct attacks on drive in relation to 
food deprivation can be launched without worrying about drive as a 
multiplier of habit strength. The fractional anticipatory goal response 
is substantive and ought to be directly observable in appropriate experi- 
ments. A neurophysiological theory, provided it leads to concrete ex- 
periments, can give a substantive basis for the mediating processes in 
either the peripheralist or centralist theories. 

3. Greater inclusiveness. Spence ( 1953 ) has given us a diagram show- 
ing six levels of lawful relationships within learning data, to which corres- 
pond six first-level theories, close to these varieties of data. Above these 
in a logical hierarchy are second-level theories that co-ordinate the first- 
level theories, and then there may be more general theories at the third 
level. It is his contention that we must work up from the fields in which 
the data are best quantified, and where the interrelationships of vari- 
ables have been most thoroughly explored. At present, our most precise 
theories deal with conditioning, selective learning, and serial learning. 
Our first-level theories are very weak in the study of skills, problem- 
solving, and group learning. Spence’s diagram is reproduced as Figure 
67. 

We may accept Spence’s analysis as historically accurate, without be- 
ing entirely bound by it. Some theorists, for example, instead of waiting for 
the better lawful relationships to be established in the fields of skill, 
problem-solving, and group learning, will wish to set up models to 
help in the design of experiments that will result in studies revealing 
lawful relationships in these areas. Spence’s diagram has the virtue that 
it does not prejudge the question as to which kind of learning is more 
fundamental , for the six kinds of learning are all at Level 1. The fact 
that the situations at the left are somewhat simpler , in terms of the 
control of experimental variables and the gathering of data, need not 
imply that any more fundamental psychological processes are involved. 

The search for greater inclusiveness is one direction for theory to take, 
and this search does not have to wait. 
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4. A more exact applied psychology of learning. This scarcely 
sounds like theory at all, yet a high order of theoretical work will be 
necessary to arrive at a set of practical rules that will make general 
psychological laws of learning applicable to teaching and training. It 
is one of the challenging tasks confronting the psychology of learning, 
and able talents are needed for it. 
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FIGURE 67. Levels of learning theory in relation to lawful relationships derived 
from experiments. Spence (1953). 


These several directions of advance suggest how much there is to 
be done, and how defeatist it would be to wait for the “brilliant theo- 
rist,” the Newton or the Einstein who will solve all of our problems for 
us. 


SPECIAL PROBLEMS THAT LEARNING THEORIES FACE 

Next in our retrospect we turn to some of the typical problems that 
were used to summarize the several systems, and to some related ques- 
tions that arise as we consider the problems the theorists tried to 
solve. 

Learning Capacities 

The extreme form of generalization from comparative studies is that 
there are no differences, except quantitative ones, between the learning 
of lower mammals and man. Hull stated this quite clearly: 

The natural-science theory of behavior being developed by the present au- 
thor and his associates assumes that all behavior of the individuals of a given 
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species and that of all species of mammals, including man, occurs according to 
the same set of primary laws. 3 

While such a position is more often implied than asserted, it is 
strange that the opposite point of view is not more often made explicit — 
that at the human level there have emerged capacities for retraining, 
reorganizing, and foreseeing experiences which are not approached by 
the lower animals, including the other primates. No one has seriously 
proposed that animals can develop a set of ideals that regulate conduct 
around long-range plans, or that they can invent a mathematics to help 
them keep track of their enterprises. Because a trained dog shows some 
manifestations of shame, or a chimpanzee some signs of co-operation, 
or a rat a dawning concept of triangularity, it does not follow that these 
lower organisms, clever as they are, have anything comparable to the 
richness of human mental activity. 

The problem of emerging types of learning ability must be pushed 
farther back. Language in man is perhaps the clearest of the emergents 
which carries with it a forward surge in what may be learned. It seems 
plausible enough that other advances in the ability to learn must have 
come about as the nervous system evolved through successive stages be- 
low man. The prevailing learning theories have had remarkably little to 
say about the different processes which together make up the functional 
changes which are classified together as learning. 4 There are probably a 
number of different kinds of learning which have emerged at different 
evolutionary periods, with the more highly evolved organisms using 
several of them. It is quite probable that these different kinds of learn- 
ing follow different laws, and it is foolhardy to allow our desire for 
parsimony to cause us to overlook persisting differences. 

Learning capacity can be assigned a quantitative meaning only when 
the several learning capacities are identified, and when we know more 
about the ways in which these capacities are modified. To what extent 
can the ceiling of ability itself be modified by training? It is evident that 
without appropriate prior experiences there are things to be learned that 
are quite within a mans capacity which he will never acquire. Without 
appropriate linguistic or mathematical tools, for example, there are 
kinds of problems he cannot solve, regardless of how much native in- 
telligence he has. After these tools are acquired, his learning ability rises, 
without any necessary inference that the ceiling of his ability was raised. 

The obstacle to a precise scientific statement of the problem lies in 

3 Hull (1945), page 56. 

4 Maier (1939a, 1939b) and Tryon (1940) have stressed similar points. 
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the difficulty of assigning such a limit until one is approached. The notion 
of a physiological or psychological limit as taught in our elementary 
textbooks is a convenient fiction. The records of athletic contests suffice 
to show how impractical it is to assign definite limits, because records 
are made by improving methods, and are not set solely by the speed 
of conduction of neural impulses. When the bamboo pole was intro- 
duced into pole-vaulting, for example, records increased by a full foot, 
but improvements in technique have led to their continued improvement 
beyond this. Kantor 5 has made much of the interbehavioral meaning of 
abilities. He would argue that typewriters must enter into the equations 
defining typing speed and poles into vaulting records. The difficulty 
exists, and must be taken account of. There is, however, an asymmetry 
between the learner and his instrument. The typewriter or the pole does 
not learn, and there ought to be some way of getting at what the limit 
would be for a given typist with a given typewriter or a given vaulter 
with a given pole. Until the ceiling of ability gets a satisfactory theo- 
retical-empirical definition, such problems as the effect of practice on 
individual differences can receive only superficial answers. 

Simple Conditioning 

The theory of simple conditioning is an inviting one. It says in 
effect that partial cues reinstate past stimulus-response situations, that 
what was done in the presence of the stimuli tends to recur in their 
presence. Despite all the attention commonly given to simple condition- 
ing in learning theory, the evidence for it is very fragmentary. There 
is no doubt that learning takes place under the arrangements of the 
conditioned-response experiment, but the results may usually be given 
alternative explanations. The fact is that there is little evidence that the 
simultaneous or nearly simultaneous occurrence of an incidental stimulus 
and an unconditioned response is the sufficient condition for establishing 
a sensori-motor association between them . 

The repeated failures to obtain pupillary conditioning through pair- 
ing of sounds with illumination changes is one of the most damaging bits 
of evidence against the conventional pattern assigned to simple condition- 
ing, for the unconditioned pupillary responses occur with regularity, but 
conditioned responses do not follow. 6 

5 E.g., Kantor (1947). 

6 To the earlier failures must be added those of Hilgard, Dutton, and Helmick 
(1949), and F. A, Young ( 1954). Successful conditioning of die pupil in human sub- 
jects has been reported by Mihama and Kotake ( 1954). Their results were so unstable 
that one would scarcely wish to base a theory of learning upon them. 
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If simple conditioning is not accepted as the rule, the alternative 
is to find the contingencies which give rise to results looking like simple 
conditioning. The usual solution is to find the stimulus-response se- 
quences related to an ongoing goal-directed activity, whether this be 
described as proximity to a reinforcing state of affairs, or as part of a 
field tension-system. It is undeniable that in many instances the learner 
will do what he last did in the presence of given stimuli because similar 
motives are operating, and what he did last time achieved the goal. 

Some types of direct action release — weeping at a letter bearing sad 
news, fainting at the sight of blood — appear to conform to a simple 
conditioning formula, but the full consideration of what is happening 
involves motives like anxiety, making these cases not unlike other illus- 
trations of learning with tension release. Action systems that seem to be 
directly released by perceived stimuli usually have either a low threshold 
or are supported by strong motives. Some people weep easily, so that 
tears to a letter would be a matter of low thresholds; others would weep 
only if the letter reported the loss of someone deeply loved. Some reac- 
tion systems, such as salivation and neurocirculatory changes, are in un- 
stable equilibrium most of the time, so that they are easily disturbed by 
a wide range of perceptual experiences. Because these actions can occur 
without environmental support, they may be activated independently 
of dominant ongoing activity at the time. 

While allowance must be made for some relatively automatic as- 
sociated movements released by perceived stimuli, in the more usual 
case these movements are integrated with goal-directed activity. This 
follows because the motives which support the goal-seeking activity serve 
also to lower the thresholds for the perceptually-aroused concomitants. 
Hence the hungry dog is likely to salivate while nosing up the lid to 
the food-box. Because the dog is hungry the threshold for salivation is 
low, and the sight of food or food-related objects is likely to lead to 
salivation. 

Hence, as alternative to the simple conditioning notion of associative 
learning we may consider several possibilities: (1) the aroused percep- 
tion may serve as an intermediary between the conditioned stimulus and 
the conditioned response; (2) sometimes the appearance of the condi- 
tioned response will be a matter of facilitation rather than of associa- 
tion; (3) goal-directed activity may provide an important background 
for what appears to be simple conditioning. 
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Stimulus , Response 9 and Mediating Processes 

Stimulus and response concepts have a long history in psychology. 
The original S-R psychology was not that of learning theory at all: it be- 
gan with reaction time, as psychologists measured the time between the 
signal and the response to it, and then tried in various ways to fractionate 
this time. Even at the start, it was clear that there were intermediate 
processes beyond merely the speed of impulse transmission. In his early 
criticism of the reflex-arc theory, Dewey (1896) had in mind these re- 
action-time experiments and the generalizations that were being made 
from them. Woodworth, a student of the neuro-physiologist Sherrington, 
urged strongly that the S-R motto be hung on the walls of the psycho- 
logical laboratory as a reminder to the student to think in stimulus- 
response terms. Woodworth later modified his S-R motto to read S-O-R, 
recognizing that the mediating processes in the organism always had to 
be taken into account in the correlation between stimulus and response. 

It is meaningless to approve or disapprove of stimulus-response in 
general, because all psychology can be translated into stimulus-response 
terms. Before expressing approval or disapproval it is important to 
specify what kind of stimulus-response theory is under consideration. 

Two kinds of stimulus-response theory are worth distinguishing. 
One of these we shall call a substantive S-R theory, the other a func- 
tional S-R theory. 

By a substantive S-R theory we shall mean the kind of theory that 
says that the only facts of interest to a psychologist are facts of physical 
stimulation and measurable responses (muscular movements and glandu- 
lar secretions). Not only is the theory tied to S’s and R’s, but all inferences 
are to S’s and R’s, including inferences to mediating processes . For this 
reason, such theories are also called peripherals Hulls fractional antici- 
patory goal response (r G ) is a good representative of a substantive 
intermediary. 

A functional S-R theory is anchored to stimuli and responses as points 
of reference, in giving operational definition to the measurements used in 
the theory. That is, the theory always points to S’s and R’s in describing 
its experimental results or in beginning its theory construction. Mediat- 
ing processes, however, need not be conceived in S-R terms, and they 
may have purely functional characteristics. The drive level (D) in the 
theories of Hull and Spence is a concept of this kind. It is given opera- 
tional definition in physical terms, according to maintenance schedule 
and hours of food deprivation, but D itself is characterized by a mere 
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number, a multiplying constant without any dimensions of its own, unless 
we infer it to have the dimensions of reaction potential. Thus a system 
such as Hulls or Spences may be substantive in some parts and func- 
tional in others. 

The functional S-R theory is generally congenial to the functionalist 
psychologists, whereas the substantive S-R theory is more common among 
neo-behaviorists. Thus the stimuli used in a memorization experiment may 
be classified by the functionalist according to degree of meaningfulness. 
Because degree of meaning can be specified denotatively by reference to 
the material used in the experiment, the difficulty of defining the 
material in strictly physical stimulus energy terms does not worry him, 
although it should worry the substantive S-R theorist. The functional 
S-R position permits further specification of the mediating processes, 
but these need not be specified in S-R terms. Thus the mediators can be 
brain states, if evidence can be found for them. 7 

MacCorquodale and Meehl (1953) have shown that an expectancy 
theory can be written in functional S-R terms, so that the main difference 
between their theory and one such as Huffs or Spences is the difference 
that I have indicated between substantive S-R theories and functional 
S-R theories. 

If this analysis is correct, the issues between the theories arise 
largely over the nature of and specification of the mediating processes. 
The factual results are going to be accepted in the end by any theory 
that survives, so that there is no point in looking for a crucial experiment 
at the level of learned behavior. It is only experiments that bear crucially 
on the mediating processes that will make possible a choice between 
competing theories of learning. Once inferred intermediaries appear to 
account satisfactorily for the relationships found in experiments, experi- 
ments designed to study these intermediaries as directly as possible will 
prove helpful. We do not have enough such experiments today to force 
the impartial judge to select one general theory over another. 

Perception and Learning 

The problem of perceptual discrimination is a central one for learning 
theory. There is some uncertainty as to the manner in which perception is 
natively organized, but at a primitive level there are undoubtedly some 

7 Thus either substantive S-R theorists or functional S-R theorists may use hypotheti- 
cal constructs in the terminology of MacCorquodale and Meehl (1948), but the sub- 
stantive S-R theorist puts them in S-R terms, if he is consistent. Osgood's ( 1953) media- 
tion hypothesis appears to me to fit the pattern of hypothetical constructs within a 
functional S-R theory. 
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discriminations of continuity and discontinuity, of change and succession, 
of boundary phenomena, of proximity and remoteness. There is an old 
controversy between those who believe that the organization of perception 
is essentially unlearned, and those who believe that practically everything 
within perception is learned. 8 It is not necessary to enter this controversy 
in order to point out that the learning theorist must be aware of the learn- 
ing within perception as well as of the role of perception in learning. 

The perception of objects suffices to show that there is at least some 
learning in perception. A cake of ice looks cold; the red tip of the poker 
looks hot. The properties of cold and hot, not present to the senses, are 
present in the perception as a result of prior experiences. They are not 
judgments or deliberate inferences from perceptual data, but are given 
as immediately as any other of the properties of the perceived objects. 

The process whereby identifying signs are used to represent a total 
situation previously experienced in called redintegration. 9 The ice looks 
cold because the previous experiences of felt cold are redintegrated by 
its appearance. The paramnesic experience (sometimes referred to as 
dejd vu) is probably of this sort. That the process occurs is evident, but its 
explanation is not easy. 

If we accept the fact of perceptual learning, and we believe in some 
sort of goal orientation in learning, can we discover any goals in per- 
ception that might control perceptual learning? I have elsewhere sug- 
gested two possible perceptual goals: (1) the achievement of environ - 
mental stability , and (2) the achievement of clarity and definiteness in 
perception . 10 

The organism seeks a perceptually stable environment in a fashion 
somewhat parallel to that in which it seeks an internally stable environ- 
ment. There is a kind of environmental homeostasis parallel to physio- 
logical homeostasis. The equilibrium is a dynamic one, and the external 
environment, like the internal one, is ever changing. The organism toler- 
ates perceptual differences between night and day as it does physio- 
logical differences between waking and sleeping, but it does not accept 
an environment that distorts too rapidly. If a man s perceived environ- 
ment distorts too rapidly he gets upset or seasick. 

In normal perception the goal of stability accounts for many of our 
perceptual achievements. For example, were it not for this achieved 

8 The contrasting positions are represented by Koffka (1930) and Carr (1935). 

9 Hollingworth (1928). 

10 Hilgard (1951). The next few paragraphs borrow heavily from this source. 
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stability, the visual world would move as we move our heads from side 
to side. That such stability is an achievement is easily demonstrated by 
seeing what happens when you view the world through reversing lenses. 
When the visual world is unfamiliar, as it is through reversing lenses, 
the line of regard is the anchoring point, and when you move your head 
the world races by in the direction opposite to the movement of your line 
of regard. Stratton ( 1896 ) ( 1897 ) found that after wearing the lenses for a 
while the visual scene no longer would swing with head movements: sta- 
bility had again been achieved. 11 

The goals of stability and of definiteness are similar, but they can 
be distinguished. In reversible geometrical figures, for example, defi- 
niteness is achieved at the price of stability. Stability might be better 
achieved if a geometrical figure were perceived as only so many lines 
on the paper. Instead we see it as a something , even though it is am- 
biguous, and so the "somethings” it represents tend to alternate. 

Woodworth felt convinced that there is a fundamental motive to 
perceive clearly. As he put it: "To see, to hear — to see clearly, to hear 
distinctly — to make out what it is one is seeing or hearing — moment by 
moment, such concrete, immediate motives dominate the life of relation 
with the environment.” 12 

Recent emphasis upon "curiosity” as a motive for learning suggests 
that such "curiosity” may also be a motive within the learning of per- 
ceptual responses themselves. 13 

There is a strong tendency to construct concrete things out of the 
patterns we perceive, for concrete things have definiteness. Hebb dis- 
tinguishes three kinds of perceptual unity: a primitive, sensorily deter- 
mined unity, as that of a splash of black ink on a white card; a unity 
affected by experience, as in the perception of familiar geometrical 
figures such as squares and circles; and a unity of identified figures or 
objects. 14 Once a perceptual pattern is identified as a familiar object, 
the perception of the object remains remarkably constant, and the so- 
called object-constancies apply. 15 

These conjectures suggest the possibility of further experiments on 

11 Some fascinating new experiments wjth inverted vision have recently come from 
Innsbruck, substantiating and extending Strattons findings, e.g., Kohler (1953). For 
a discussion of them, see Werner and Wapner (1955). 

12 Woodworth (1947), page 123. 

15 The experiments of Butler (1953) ( 1954) on curiosity and Montgomery's experi- 
ments on exploratory drive, Montgomery ( 1952) and later, have already been cited. 

i* Hebb (1949), 19-35. 

16 E.g., Brunswik (1944) (1955). 
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/learned features of perception, in line with other studies of motivated 
/rning. 16 

The relationship between perception and learning is a reciprocal 
one. So far we have considered the role of learning in perception. Now 
we may consider the role of perception in learning. The cognitive 
theories all assign an important place to perception and perception-like 
processes in learning; some even have gone to the extreme of identify- 
ing learning with perceptual restructuring. 17 Currently S-R psychologists 
are paying more attention to perception in learning. Spence (1955) 
includes “looking at” the food in the goal-box as a feature of reinforce- 
ment. Lawrence (1949) (1950) emphasizes acquired distinctiveness of 
cues as a part of discrimination learning. Wickens (1954) decides that 
the stimuli in learning experiments arouse perceptual responses that are 
the intermediaries to which overt movement responses are made. 

A complete learning theory must incorporate perceptual processes 
as part of the main body of theory. Secondary reinforcement, increasingly 
prominent in S-R theories, is mediated largely by something akin to per- 
ception. These relationships cannot be permitted to remain unexamined 
by the theorist. 

Motivation 

In Chapter 12 we had occasion to review some of the studies con- 
cerned with the details of drive and reinforcement, and with the current 
criticisms of the need-drive-incentive conception of motivation. An im- 
portant contemporary trend is to supplement the tension-reduction theory 
( escape from food tensions, escape from pain or fear or anxiety) with more 
positive incentives, such as exploration and curiosity, or the arousal of 
positive (pleasurable) affect. 

That there are goal tensions, and that the goal situation is an end- 
state that makes a change (either toward relaxation or elation) cannot 
be denied. In that sense, everyone must accept a reinforcement princi- 
ple, meaning nothing more by it than that rewards and punishments mat- 
ter for learning. This has come to be called the empirical law of effect . 
Its incorporation into quantitative theories may remain at this level, 
but its further theoretical explanation raises a number of additional 
questions. 

16 The many recent studies of perceptual distortion by way of needs and persistent 
personality characteristics are also relevant even though somewhat controversial. See 
summaries by Bruner (1951), Klein (1954), Postman (1953). Against a learning 
theory of perceptual change, see Gibson and Gibson (1955). 

17 Krech and Crutchfield (1948). 
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The necessary distinction between performance and learning, be- 
tween facilitation and association , as Seward (1952a) puts it, does not 
require that motivation be assigned to one of the terms and not to the 
other. Everybody agrees that motivation affects performance, but there 
are arguments over the extent to which it affects the learning of habits 
or of cognitive structures. The problem is an empirical one, but the 
goals of the learner almost surely affect both learning and performance. 
Motivation determines attention, and attention is related to cognitive 
restructuring, the form of learning that is supposed to be most detached 
from performance. The effect of motivation upon learning and perform- 
ance is not symmetrical. A starved person may be motivated to eat but not 
to learn new table manners. Table manners are probably better learned 
when the hunger drive is less strongly aroused. 

The introduction of personality variables into conceptions of drive 
(e.g., anxiety or need for achievement as measured by tests > suggests 
an important aspect of motivation long neglected in learning theories: 
the relation of the given circumstances to the more persistent goals of the 
individual. If there is any teaching that has come to be generally accepted 
as a result of Freudian psychology (and related conceptions of person- 
ality), it is that motives are organized in some sort of hierarchy within 
the individual, resulting in a value-system expressed in behavior. This 
system may go by such names as “ego” or "character-structure,” but 
whatever it is called, it becomes very important for the learning of that 
individual. A sophisticated learning theory, however it conceives its 
primary data or principles, must take into account the organization of 
motives in the individual. Hence concepts such as ego involvement and 
level of aspiration, as introduced by Lewin and his followers, become 
important supplements to theories of reward and punishment in the inter- 
pretation of motivated learning. 

Reinforcement , Provisional Try, and Feedback 

Many learning problems require the selection of one or another 
possible mode of action in order to reach a goal. Because alternative 
responses appear one after anoth^fc^ntil the correct one is stumbled 
upon, this learning is commonly delualbed as trial and error. This designa- 
tion is descriptively appropriate to\ several standard laboratory experi- 
ments. In motor-skill experiments, acceptable responses have to be dis- 
covered within a range of movement possibilities; in the problem-box, 
there is usually one correct response among many unspecified possi- 
bilities; in discrimination experiments, selection is to be made from among 
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fixed alternatives; in the maze and multiple-discrimination experiments, 
successive choices must be integrated in proper order. 

Two main theoretical problems arise: the problem of the nature 
of the original adjustment ending in the correct act, and the problem 
as to how this adjustment is facilitated when the situation is repeated. 

Thorndikes explanation has tended to separate the two problems. 
The initial adjustment is a result of the tendency of the organism to vary 
its responses flaw of multiple response”). The acts within its repertory 
run off more or less by chance until the correct one is hit upon. Then 
the second principle comes into play* The correct response is stamped in 
("law of effect”). When the situation recurs, the rewarded response has 
a favored position in relation to the rest of the possible multiple responses, 
and tends to occur earlier than before. Because prior experiences will 
have established preferential orders of response even in a situation rela- 
tively novel, the multiple responses will never, in fact, be purely random. 
Hull has worked out a theory like Thorndikes, giving an orderly account 
of these possibilities. 

The alternative is that the original behavior is not the running off of 
earlier habits in the new situation, but is a genuine attempt at discovering 
the route to the goal. Past experience is used, but in a manner appro- 
priate to the present. Such an interpretation makes the original adjust- 
ment a provisional try ; to be confirmed or denied by its success or failure. 
What is here being called a provisional try corresponds to what Tolman 
and Krech have called ‘hypothesis” behavior. The theory supposes that 
a provisional behavior route is kept in suspension until its consequences 
change its provisional status; if it is confirmed it is an appropriate path 
of action to be followed under like circumstances. 

When the provisional try was first suggested as appropriate for 
much behavior classified as trial and error, 18 the concept of feedback 
mechanisms had not yet gained wide currency. All that the provisional 
try implies is a feedback that corrects the provisional responses according 
to their consequences. The conception of provisional try, and of feedback, 
suggests that the role of reinforcement is essentially informative , more 
or less what Thorndike implied spj^pdie talked of the confirming or 
"OX” response. 

There are some consequences for/quantitative theories if the informa- 
tive role of reinforcement is accepted. The relationship between the 
informative role of a stimulus and its facilitating role is not one-to-one. 
The word "yes” is discriminated from "no” when the words are spoken 
18 In the first edition of this book, Hilgard ( 1948). 
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softly as well as when they are shouted (though there are some differ- 
ences, as every army recruit and army sergeant knows!). Hull theorized 
that habit strength requires some (unspecified) minimum of drive- 
reducing reinforcement, but is not dependent on the amount of drive 
reduction beyond the necessary minimum. Hulls position can be as- 
similated to an information theory. The minimum reinforcement neces- 
sary for habit strength is that required for information, for feedback; 
excesses beyond that affect reaction potential but not habit. 

The conception of the provisional try is especially appropriate when 
the movement called for is not clearly specified by the apparatus. In 
mirror drawing or pursuit learning or ball tossing the performance called 
for is a pattern of movements correct in direction and (in some cases) in 
rhythm. The general character of the correct movement is understood at 
the outset, but it cannot be produced without considerable fumbling. 
This fumbling is a real try. The learner does not act in chance fashion, 
no matter how clumsy his movements may be. In this kind of situation 
either Dodge’s designation approximation and correction or Woodworths 
trial-and-check is appropriate. 19 Theories of spontaneous movement and 
of conditioning of previous responses have never been successful in deal- 
ing with the detailed movements that occur in manual skill. The sug- 
gestion that any new movements must be compromises between move- 
ments elicited as old responses to familiar stimuli does not take the 
problem seriously. The movements are new integrations at first crudely 
adapted to the situation at hand, presently becoming better adapted. 

It may be that an organism at some early stage in its development 
has to learn how to try, 20 but by the time it is ready for the usual labora- 
tory experiments it has learned, and its behavior is probably not greatly 
unlike the human learner acquiring a skill. The cats behavior in the 
problem-box is best described by saying that the cat is trying to get 
out of the box. When the cat quits trying, and, as Lewin would say, 
“goes out of the field,” its not-trying is evident also. It may lie down, 
lick itself, appear to be relaxed, then suddenly return to a state of tension 
and again begin to try. 

The many experiments currently under way on visual tracking 21 
(that is, manually following a visual target that moves in either a regular 

10 Dodge (1931), page 113, Woodworth (1947). 

20 Hull (1937) shows the possibility of deriving “striving” from more elementary 
principles. 

21 The experiments have been encouraged by the need for this performance in mili- 

S activities, e.g., in tracking a moving target with a gunsight. See Chapanis, Gamer,. 
Morgan (1949), pages 264-295. 
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or an irregular path) require for their explanation: (1) the goal of re- 
maining on the target, and (2) the perception of discrepancies from 
the target, resulting in corrective movements. This is provisional-try 
behavior, corrected by feedback from the consequence of the attempted 
positional movements. 

The "tracking” behavior of a hunting dog gives a naturalistic illustra- 
tion of similar behavior. Woodworth, for example, long ago used the 
behavior of the dog as an illustration of "seeking” behavior: 

That there is a persistent inner tendency towards the consummatory reac- 
tion is seen when, for instance, a hunting dog loses the trail; if he were simply 
carried along from one detail of the hunting process to another by a succession 
of stimuli calling out simple reflexes, he would cease hunting as soon as the trail 
ceased, or follow it back again; whereas what he does is to explore about, seeking 
the trail, as we say. This seeking, not being evoked by any external stimulus (but 
rather by the absence of an external stimulus ) , must be driven by some internal 
force; and the circumstances make it clear that the inner drive is directed toward 
the capture of the prey. 22 

This behavior, controlled by some sort of "fit” between the perceived 
environment and the goal-seeking behavior of the organism, involves 
the assimilation of feedback information along the route as well as at 
the end. It descriptively justifies a concept such as that of the provisional 
try. The incorporation of this behavior into a more general theory re- 
quires further steps that have not as yet been taken. 

Overlearning and Stereotypy 

A theory which assumes that behavior is regulated reasonably 
on the basis of available information is put to it to account for the per- 
sistence of habits in situations in which they are no longer adaptive. 

One kind of stereotypy, normally adaptive, is a consequence of 
overlearning. By overlearning is meant the facilitation of response with 
repetition after the essential learning has been mastered. Memorization 
furnishes convenient illustrations. A series of digits can be learned in a 
single presentation so that it can be correctly repeated if recall is im- 
mediate, But if the series is long enough to be near the limit of immediate 
memory, more repetitions are needed to fix the list so that it can be re- 
called after greater lapse of time. The most evident effect of overlearn- 
ing is this one upon recall. It provides the explanation for the long 
retention of overlearned skills such as swimming or bicycle riding. 

But overlearning has other consequences than guaranteeing re- 
tention. These have been little studied, and only a few suggestions are 
22 Woodworth (1918), page 41. 
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available. One possible result is a mechanization of habit, leading to a 
loss of docility.” Oft-repeated activities take on some measure of goal- 
character, perhaps through a process like secondary reinforcement. The 
learner is then “blinded” to new possibilities. 23 Rats which have learned 
a path including a jump appear to prefer that path to other more eco- 
nomical ones after they have been overtrained on it. 24 Maier and his as- 
sociates interpret some of their data as showing that habits learned in 
accordance with original preferences are more easily altered than habits 
built up against original preferences. 25 If this be generally true, it may 
account in part for the persistent preferences by adults for highly arti- 
ficial ways of doing things learned as children. 26 

The familiar way of doing things is commonly economical, and there 
is no special mystery about its persistence. It has had motivational sup- 
port according to any theory. There are other instances of stereotyped 
fixations, however, which have had no such history of support as adap- 
tive habits. Their occurrence raises additional problems. 

One source of fixated behavior is the insoluble problem. There is no 
way in which to make a reasonable adjustment, because the order of events 
in the environment is chaotic. There is no basis for choice, and a ritual- 
istic fixed choice is adopted. Such a ritualistic performance represents a 
giving up of the attempt to predict the occurrence of reward on the basis 
of “correct” behavior. The environment is unresponsive to the trying by 
the learner. It is not surprising that such a pattern of behavior persists 
against punishment. Maier s finding that there was evidence that positive 
and negative cards were in fact discriminated, even though the position 
habit remained fixed, means in cognitive terms only that the organism 
finds the situation too difficult. The problems which experimenters set 
animals are often fairly complex when translated into our own experi- 
ences. In this situation the animal learner, in adopting the fixated habit, 
has rejected the second window as no better than the first, since its 
provisional tries were frustrated as frequently at one place as the other. 
Because there now develops a pattern of predictable success and failure 
at the first window does not mean that things would be any better at 
the other one. A human parallel might be as follows. Suppose that in 
long experience the front door and the rear door of the laboratory had 
always been unlocked together. Even though one knows about both 

23 Luchins (1942), Wertheimer (1945). 

2 * Gilhousen ( 1931 ) . 

25 Maier, Glaser, and Klee (1940). ^ 

26 Holt ( 1931 ) attributes this fixity of familiar ways of doing things to "canalization” 
in childhood. His position has been adopted and extended by Murphy (1947). 
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doors, he might sit a long time before the front door, waiting for it to 
be opened, before trying the back door, even though on that day a care- 
less attendant had unlocked one door and neglected the other. Fixated 
behavior need not be interpreted as a compulsion to which the learner is 
a slave. An alternative is that the situation is misinterpreted in the light 
of the learners experience, and the sense of alternative has been lost. 
The ease with which the alternative can be brought back into the animal 
learners behavior field was shown by Maier when, through manual 
guidance, he quickly restored the animal's discriminatory responses. 

Another source of fixated behavior is excessive punishment at the 
moment of choice. The general frustration probably interferes in this 
case with orderly cognitive processes, so that the learner falls back on 
stereotyped responses. The reduction in cognitive clearness under intense 
emotion is what is proposed in the expression “blind with rage” 27 

Still another form of stereotypy is that in the Guthrie and Horton 
experiment, for it is not based on overlearning or insolvability or frustra- 
tion. The two features of the Guthrie and Horton situation which may 
have given rise to stereotyped behavior are that the task is ( 1 ) easy, in 
the sense of being promptly solved, and (2) baflling, in the sense that 
the response being made is unclearly related to its consequences. The 
most economical behavior in a situation that is confusing but successful 
is to do what worked the last time. If a door that is stuck opens promptly 
when you happen to raise up the knob, the tendency will be great to 
repeat that behavior the next time without inquiring into the physics of 
the situation. Maybe it would also open if you pushed down, but there is 
no need to try the alternative if the prior solution came quickly. The 
cats that responded directly to the pole, while facing it and the door, 
transferred more readily to new pole positions than the cats that backed 
into the pole. The direct tries of practically all of the cats were oriented 
toward the escape door. The pole was struck in a posture that was door- 
related, not pole-related, especially by those cats that were backing up. 
If the relations were unclear, but the solutions prompt, the stereotyped 
performance would follow, according to the conjectures we have ad- 
vanced here. 

Overleamed, fixated, and stereotyped behavior can be made co- 
herent with the theory of the provisional try if two assumptions are ac- 
cepted. One assumption is that after sufficient overlearning the learner 

27 The complex role of emotion in learned activity deserves more detailed study than 
it has been given. See, for example, Brady and Hunt (1955). 
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no longer tries, unless something dramatic again arouses his searching 
behavior. Otherwise the old ways of doing things are preferred. The 
other assumption is that stereotypy results when situations are cognitively 
too difficult. Here again the organism stops trying, and adopts a patterned 
mode of response without regard to the environmental consequences. 28 

Understanding , Problem-Solving, and Inventive Ideation 

Because learning is to some extent under cognitive control we must 
not suppose that it always proceeds with full understanding. A correct 
response may be made time and again in a situation without the learners 
awareness of what is being done. The probabilities of a situation may 
be estimated without the learner's making any computations. The experi- 
ments of Thorndike and his collaborators on learning without awareness 
are of this sort. 29 The subjects made discriminations on the basis of the 
probabilities of the situation, but could not tell what they had done. Yet 
their responses were predictable in terms of the arrangements of the 
learning situation. The experiments were used to substantiate a blind 
law-of-effect interpretation, but they may be explained equally well on 
the basis of cumulative discriminations among real tries. 

While there is a tendency to achieve clarity and to learn with un- 
derstanding when that is possible, there is a limit to the learner s curiosity, 
and it is not true as implied in some gestalt writings that learning takes 
place with all the understanding which the situation and the capacities 
of the learner permit. The understanding which the learner wishes in a 
problematic situation is knowledge of the essentials to economical goal- 
achievement, and nothing more can be counted on. The mistake is some- 
times made in teaching school children ( or college students ) of assuming 
that they wish to understand what lies behind a process which is for them 
just a tool. They wish to know how to use the tool to reach immediate 
goals; the further curiosity is related to different goals, which may be 
goals for the teacher but not for them. 

Because all learning is to some extent cognitively controlled, the 
distinction between blind learning and learning with understanding be- 
comes one of degree. There is a point, however, at which understanding 
takes on new prominence. That is in the kind of creative attack on a 
problem in which experience does not provide solutions ready-made, but 

28 Note in this connection the falling back on position habits when too many dis- 
criminatory cues are involved, Lawrence and Mason (1955). 

29 E.g., Thorndike and Hock 1934). For a different sort of criticism, see Irwin, 
Kauffman, Prior, and Weaver ( 1934 ) . 
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the solutions have to be discovered or invented through a reconstruction 
of experience appropriate to the problem at hand. This is not completely 
specified by experiments on insight, but it is suggested by them. 

There are several alternatives. One is to treat all problem-solving 
and reasoning as a form of trial-and-error learning, with some transfer 
to account for short-cut solutions. This is the position of Carr (1925), 
endorsed by McGeoch (1942). This position fails to grant any reality 
to the reconstruction of experience present in the act of reasoning which 
makes the “try” something other than the running off of varying habits. 
A second alternative, that of Hull (1935c), is to recognize that the mecha- 
nism for assembling prior experiences appropriate to problem solution 
has in it characteristics different from ordinary trial-and-error learning. 
Hull attempts to derive such behavior from other processes. This deriva- 
tion depends upon discrimination among the fractional anticipatory re- 
sponses that are common to several experiences and have to be inte- 
grated in order to solve the problem. 

Stimulus-response psychologists, under the restraint of Lloyd 
Morgan's principle of parsimony, 80 have assigned as little as possible to 
ideation, but they have usually recognized that some behavior is con- 
trolled by ideas. Thorndike called ideas by that name and granted their 
occurrence; Hunter recognized a “symbolic process” necessary to account 
for behavior in delayed-action and double-alternation experiments; 31 Hull 
inferred “pure-stimulus acts,” that is, responses whose sole function is 
to furnish stimuli to guide or integrate behavior sequences; Guthrie's 
position is about like Hull's, with great emphasis upon movements (es- 
pecially implicit speech) as the substance of thought. Tolman, in be- 
haviorist spirit, makes of ideation a “mere behavior-feint' at running- 
back-and-forth,” without specifying the precise nature of the feint. 

Is there any gain, having accepted ideation, in seeking a movement 
basis for ideas? To assert that the organism can discriminate between 
the sensory consequences of two kinds of anticipatory salivation more 
readily than between two perceptual redintegrations is no great gain, 
except for the logical possibility of discovering by separate techniques 
the presence of two kinds of salivation. Because both perceptual redinte- 
grations and stimuli from anticipatory responses are, in fact, inferred, it 
is as possible to be logically strict about one as the other — and the evi- 
dence used in the inference for one has to be as good as the evidence for 
the other . The benefit of the doubt need not go to the one preferring 

30 Morgan (1894). 

31 Hunter (1924). 
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movements. There is some concealing of ignorance in attributing specific 
stimulus-response bases for psychological functions such as drives, sets, im- 
ages, and thoughts. If we are critical about accepting the results of experi- 
ments, then stomach contractions are not the basis of appetite, eye move- 
ments are not the cause of the Miiller-Lyer illusion, kinesthetic cues are 
not the preferred ones in maze learning, tongue movements are not the 
basis for thinking. The burden of proof is on those who believe otherwise. 
That there are some movement concomitants of thought is insufficient 
evidence that these movements (and their consequences) are the suffi- 
cient condition for the occurrence of the thought. The argument that 
inferring movements will make easier the task of the neurophysiologist 
does not hold. It is no harder to make a brain model for perceptual red- 
integration than for the pattern of discrimination among the proprio- 
ceptive consequences of anticipatory responses. 

The alternative to trial-and-error and reinforcement theories is to 
make allowance for a genuine reconstruction of experience appropriate 
to problem solution, as proposed by Tolman (1932b) in his principle of 
creative instability and by Maier (1931) in his separation of reasoning 
from other kinds of learning. An organism capable of functioning at 
different levels does not keep his abilities in abeyance to be brought out 
to suit the convenience of an experimenter. Therefore if a given ability 
is shown in clear outline in one experimental situation, it may be sup- 
posed that if looked for it might also be found in other situations in 
which it is not as clearly exhibited. 32 Thus the reconstruction of experi- 
ence goes on to some extent in every act of redintegration. When ice is 
seen as cold, or a distant object as a motorcycle, the perceptions are not 
photographic, nor are they simply revivals of past experiences. They are 
perceptual achievements molded out of past experience and present 
sensory discriminations. It is only the more striking recombinations of 
previous experiences and present perceptions that we call reasoning. 
These more striking instances reveal aspects of the process which might 
otherwise be overlooked. 

One aspect of the detour or roundabout experiment emphasized by 
Kohler is the turning away from the ends of action to the means. The 
direct path to the goal may be blocked. A stupid animal will continue 
to make a direct attack, but a more intelligent one is able to turn its 
back and to take the long way around. Discovering a long route which 

82 Note this inversion of Lloyd Morgan's canon. As now inverted it might be para- 
phrased: “An organism capable of ideational problem-solving may also use ideas in 
learning situations in which they would be theoretically unnecessary.” 
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already exists is one way of solving such a problem. It is trial-and-error 
behavior, but in a direction against the attraction of the goal. Another 
solution is that of discovering or creating some sort of tool which then 
becomes a means to the goal. This often requires a reconstruction of 
experience, perhaps by putting some familiar object to unfamiliar use: 
making a pliers into a pendulum-bob, a clamp into a hat-hanger, a 
Bunsen-bumer tube into a blow-pipe, and so on. It is this reconstruction 
aspect rather than the detour aspect which is most significant in reason- 
ing experiments. 

The reconstruction of experience follows certain lines of solution 
which Maier calls “direction.” If there is a flame to be blown out, one 
“direction” is to find some way to direct a jet of air upon it, another 
direction is to use water, still another is to smother it with something 
solid. Each of these suggestions directs the search for appropriate objects 
to convert into means. In one of the psychological laboratories during 
World War II a glass plate was being used in a piece of apparatus in 
such a way that it was to be looked through and also used as a mirror, 
somewhat as in the Dodge tachistoscope. There were troublesome double 
images of a small fixation object due to reflections from both the front 
and the back surfaces of the glass. The first hypothesis was that the glass 
could be made so thin that the images would fuse. Successively thinner 
sheets of glass were used in following out this “direction,” but without 
solving the problem. The idea emerged of making the glass much thicker, 
so that the images could be drawn apart and one of them screened off. 
This worked. The direction “if we can make it thin enough” was one 
hypothesis used until the other hypothesis “if we can make it thick 
enough” supervened. 33 This sudden restructuring gives the subjective ex- 
perience of insight. 

Understanding and inference may be used in unsuccessful solutions 
as well as in successful ones. The flash of insight is not evidence of un- 
usual heights to which intelligence has soared, but of an aptness of the 
solution suggested. A solution may be achieved with understanding with- 
out any such experience of insight. For example, in fitting mathematical 
functions to a given set of points it is often necessary to try out curves 
from several families to see which will achieve the best fit. Such a solu- 
tion to a problem goes on at a high level of skill and of understanding, 
but when the best fit is found there is no necessary experience of insight. 
Insight ^experienced only when there is an integration of experiences, 
33 1 am indebted to S. Smith Stevens for this illustration. 
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a restructuring, a seeing of things in a new light, appropriate to the prob- 
lem at hand. 

Because the integration and reconstruction of experience go on in 
many acts of learning, reasoning need not be made a separate category 
as done by Maier. This does not mean that there may not be one or more 
special capacities involved in reasoning ability and in other sorts of 
creativity. But there may be different learning processes below the rea- 
soning level as well. In order to avoid artificial boundaries, it is prefer- 
able to include together in one family of processes all forms of problem- 
solving and reasoning, subject to later determination of the kinds and 
number of processes involved. 

To What Kind of Theory Would the Foregoing 
Comments Lead? 

The comments on the several kinds of problems facing learning 
theorists do not in themselves constitute a theory of learning, yet they 
express certain desiderata for a theory. If they were to be co-ordinated 
with any one of the preceding viewpoints, they would fit best with the 
tolerant functionalism recommended by Woodworth. 

The merit of the broad functionalist position is the willingness to 
accept new dimensions as they are discovered, to tolerate some theoreti- 
cal inconsistencies rather than fly in the face of well-demonstrated experi- 
mental relationships. Such a positivistic attitude has worked well in other 
sciences. Even the elegant science of physics has stumbled along for a 
good while with inconsistent theories such as the wave theory and the 
quantum theory of light. Biologists do not worry too much that their gene 
theories and their theories of embryological development are not yet satis- 
factorily integrated. 

The difficulty with functionalism is that it does not satisfy the esthetic 
desire for a neat system. The motivation to seek for a few postulates, to 
find interchangeable constants, to be able to move logically from one 
part of the system to another has worked well in the physical sciences, 
and there is every reason to expect it to work well in psychology also. 
Functionalism has no quarrel with this possibility, and it is entirely pos- 
sible to seek systematic elegance within the tolerance of a functionalist 
position. In the end, we shall know more and find more convenient ways 
both of expressing what we know and of finding out still more. That is 
ultimately what we are after, however that end is achieved. 
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AN EXPERIMENTAL PROGRAM 

In the foregoing suggestions toward a theory of learning the errone- 
ous impression may easily be given that the concepts represent a return 
to the pre-experimental period of psychologizing about learning. Accept- 
ance of such a point of view might be interpreted to mean turning our 
backs upon the advances which have come through rejecting mentalistic 
and mystical concepts in favor of objective behavioral ones. In order to 
allay such fears and correct misapprehensions, I wish to propose what 
appear to me to be some directions in which contemporary research upon 
learning might move. 

Naturalistic Observations 

Because the search must continue for the most appropriate concepts 
to be used in die study and description of learning, it is desirable to keep 
the approaches flexible and to welcome novelty as well as precision. In 
urging consideration for a concept like the provisional try, I have not 
intended to discard the results of experimentation which have been ob- 
tained under other concepts, but only to break through the rigidity which 
makes us think of nonsense syllable lists, mazes, conditioning experiments, 
and target practice as exhausting the phenomena of learning. 

The common acknowledgment of the ingenuity and originality of 
the experiments by Lewin and his students (even by those most devas- 
tatingly critical of his conceptual system) leads to a search for the roots 
of this freshness of approach. The answer probably lies in Lewin s tend- 
ency to pick problems out of real life situations, instead of choosing 
examples which would fit a previously learned method of study. It is out 
of real life one finds problems such as the forgetting of an intention, the 
tendency to resume interrupted tasks, and so on. Such problems are not 
thought up by students whose only concern is to counterbalance practice 
effects in a standard experiment in such a manner as to add an additional 
decimal point to the determination. 

Somewhat the same thing might be said about Freud’s contribu- 
tions. Because Freud was listening to real people in their efforts to recall 
and to come to grips with their recollections, he discovered facts about 
repression and distortion in memory which were unknown to the fol- 
lowers of Ebbinghaus. 

It would probably be worth while for someone to do for learning 
what Brunswik has been doing for perception and Barker for child de- 
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velopment 34 — that is, take a notebook and follow a child around for 
hours at a time. What are the circumstances under which he learns? 
What performances provide for recall? for recognition? for problem- 
solving? What tasks are left half-finished? What kinds of forgetting are 
there? What motives can be inferred? Such observations would be like 
a breath of fresh air in most learning laboratories. To be sure, many 
observers would see only what they were taught to see. Some would find 
nothing but trial and error, others would see the child doing only what 
he last did, others would be impressed by the child’s insight. But of a 
group who started out in this manner one or two would probably come 
back with suggestions as fertile as those of Lewin and his students. Such 
suggestions would help break the sterility of contemporary learning 
theories, and would help in the search for appropriate new concepts. 

Physiological Changes Concomitant with Learning 

No one should infer because this book has largely neglected physi- 
ology 35 that it is anti-physiological. On the contrary, it is a blot upon our 
scientific ingenuity that after so many years of search we know as little 
as we do about the physiological accompaniments of learning. 

Learning theories may be developed at a molar level, independent 
of neurophysiology. It is preferable that the division of labor between 
psychology and physiology be recognized, and that psychological theories 
be developed objectively and self-consistently at the molar level, with- 
out recourse to a hypothetical neurology or a physiology not grounded 
in demonstrated relationships. Once, however, the possibility of such a 
learning theory is granted — and there appears to be increasing agreement 
on this point — then an appropriate position with respect to neurophysi- 
ology can be established. The learning theorist does not have to wait 
for physiology in order to go ahead on firm scientific ground, but this 
does not mean that he will not welcome any advance in knowledge in 
this field, nor that he will not wish to contribute to such knowledge 
through appropriate experiments. 

One of the most crying needs is for a crucial experiment identifying 
specifically a change occurring in neural tissues (or in bio-electrical fields 
related to such tissue) as learning takes place. Possibly the first task is 
to find an organism as appropriate for this purpose as the fruit-fly has 
been for genetics. The white rat is probably too complicated. Perhaps a 

34 Brunswik ( 1944). Barker and Wright ( 1954). 

35 Except for the brief discussion in Chapter 13. 
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simpler organism like amblystoma should be chosen, in which the nervous 
elements are few 'enough to be under direct inspection with modern 
techniques. It is possible to teach larval amblystoma simple habits, 36 
and it ought to be possible to find out what sort of “trace” these habits 
leave. To refuse to search for such a change would be like refusing to 
look for the micro-organism causing malaria just because an objective 
plot of the temperature fluctuations in the patient could be made a means 
"of identifying the disease. It can be stated with reasonable confidence 
that there are changes in the nervous system accompanying learning, 
and, if there are, then there are surely no insurmountable obstacles to 
detecting such changes. It is a surprising thing that, with all the interest 
in learning, this direct identifying step has not been taken more success- 
fully. 

The other possibilities, besides the study of the simpler organism, 
include the study of special preparations, such as the spinal animal. Elec- 
trical methods of exploring the cortex may succeed, but they, like be- 
havioral studies, are likely to be at a molar level from which the 
identifying steps still have to be made inferentially. 

Much that is known and can be found out about learning will not be 
altered by changes in neurophysiological knowledge. This is the basis of 
the argument for learning theories which dispense with physiology, and 
it is sound. But it would be foolhardy to suppose that a knowledge of 
the physical substratum of learning would not be helpful in further ad- 
vancing our theories and in achieving control over the learning process. 
Such knowledge is indeed so important that psychologists interested in 
learning will wish to continue in the future, as they have in the past, to 
share with physiologists and neurologists in experimentation designed to 
get at the changes in the organism which go along with manifest behavior. 

Standardization of Methods and Measures 

Scientific facts ought to be gathered in such a way that later workers 
can move ahead on the secure foundation of fact accumulated by earlier 
workers. In more advanced sciences, such as chemistry and physics, large 
handbooks are devoted to tables and formulas conserving this accepted 
body of knowledge. 

The handbooks of psychology are not of this kind. While they serve 
as a storehouse of important experimentation and theory, they contain 
relatively few tables of measurements or constants that can be used as 
starting points for later work. Suppose, for example, a student comes to 
36 Moore and Welch ( 1940). 
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one of us and asks: “Will you please measure my reaction tim e) and tell 
me how I compare with others?” Of course we know how to connect a 
telegraph key in circuit with a sounder and an electric timer. But do we 
know how to secure results that can be compared with norms? How many 
practice trials, or perhaps sessions of so many trials each, should there be 
before we begin to count? How many measurements do we need for a 
stable value? A ready signal? The interval between ready signal and 
stimulus? If an auditory stimulus, how loud? A useful start was made by 
those who assembled what relevant material they could find in a Hand- 
book of human engineering data* 7 but the very lim itations in what they 
could find helped to show how inadequate most of our standardization 
is. The two studies of simple auditory reaction time that they report 
yielded means of 192 msec, and 153 msec., the second mean being a 
full standard deviation below the first. Of course the conditions differed, 
but that is why we would have difficulty in measuring our student’s re- 
action time and knowing whether it was long or short, unless we troubled 
to run our own normative group. 

If we are to make advances in quantitative theories of learning we 
need data that are comparable, and this means more standardization than 
has yet been achieved. 

Two arguments are used against standardization: 

1. Standardization may prove wasteful until our conceptual systems 
have settled to the point where we know what we want to measure. If 
we proceed to standardize now we may spend a great deal of effort on 
the trivial, or on some device or method tied to outworn concepts. For 
example, we might finally achieve a calibrated set of nonsense syllables, 
just to find that those working on rote memorization prefer to use mean- 
ingful words in their experiments. 

2. Standardization, because it demands conformity, might put a 
damper on originality, and hence achieve comparability at too great a 
cost. The “trained observer” of early introspective psychology ran this 
danger, and some suspected that students observed the same things as 
their masters simply as a result of this “standardization” of what was 
permissible in a report. 

Both of these arguments are somewhat misleading. Consider first 
whether or not standardization is premature. Nobody would attempt 
large-scale standardization of equipment and methods except in fields 
in which there is widespread current activity. If the fields later became 
inactive, that would not prove that standardization was a mistake. It 

37 Tufts College Institute for Applied Experimental Psychology (1952). 
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conceivably might have helped to settle some issues that had previously 
remained open. Furthermore, without standardization as much effort 
might have been spent in the meantime on unstandardized procedures 
with less definitive results. 

The second argument, that standardization cripples novelty and 
imagination, does not necessarily hold. Suppose that one wishes to in- 
troduce a new-style memory drum, or to introduce a new material to be 
learned, after standardization has been achieved with an old-style drum, 
with other material. All that is necessary is to perform calibration experi- 
ments to show that the two drums give comparable results (if they do) 
with the original material. Then the new drum can be recommended if it 
is more convenient. The new materials then need to be standardized also, 
against the norms of the previously standardized material. Thus the new 
equipment and materials, if they merit it, can be added to the store of 
standardized materials, and normative data can be published where such 
data are accessible. 38 

Without standardization, failure to repeat earlier results commonly 
leads to the excuse that conditions may have differed. This excuse needs 
to be eliminated, so that we can repeat experimental data when we try 
to. Then we can go on to a new experiment, with specified modifications , 
and criticize the interpretation of the earlier data. Standardization thus 
need not prevent originality. The gain is that results under novel condi- 
tions can be compared confidently with results under standard condi- 
tions, so that the new results can fit into the historically accumulating 
body of scientific knowledge. 

Broadening the Boundary Conditions 

For some workers, the process of standardization, because it takes 
time and requires controls, is likely to result in a narrowing of the initial 
and boundary conditions of some varieties of experiment through specify- 
ing pretraining, apparatus dimensions, and the like. But again we must 
recognize a division of labor, so that the experimenter does not have to 
be coerced into doing what he does not find congenial. For some experi- 
menters, the most inviting task is to broaden the boundary conditions of 
both experiment and theory. This is, of course, a reason for encouraging 
naturalistic observations. But these observations will not serve well unless 
they are translated into experimentable form. For example, even now new 

38 The parallel with intelligence test standardization and revision is obvious. Such 
tests, because their norms are taken seriously, have been better “calibrated” than much 
laboratory apparatus- 
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richness is being introduced into learning experiments by including mo- 
tives formerly little studied in the laboratory. So many human motives 
are personal — emulative, defiant, aggressive, appealing — that a social 
psychology of learning is sure to develop eventually. 30 


RELATION BETWEEN LEARNING AND THE PRACTICAL 
DEMANDS OF INSTRUCTION AND TRAINING 

Theories serve more than one purpose: they attempt to organize 
existing knowledge, they attempt to provide guiding threads or hypothe- 
ses toward new knowledge, and they may also furnish principles by 
which what is known can be used. 40 This practical outcome is seldom cen- 
tral in the thinking of the constructor of theory, and it is not surprising, 
therefore, that the person seeking advice from the learning theorist often 
comes away disappointed. For example, Newman, writing on the con- 
temporary status of learning theory, says that for many practical skills 
such as those required in batting a home run, fencing, boxing, fancy div- 
ing, or dancing, "a rabbit's foot is worth as much as the psychologist's 
sage advice." 41 While Newman's skepticism here is extreme, he at least 
puts us on notice not to expect from the learning theorist too much that 
is practical. 

Some Agreement on Practical Matters 

It turns out, however, that many of the quarrels of the theorists are 
internal ones, not very important in relation to immediate practical prob- 
lems; there are, in fact, a great many practically important experimental 
relationships upon which the theorists are in substantial agreement. 
That is, they accept the facts as demonstrated, even though they dis- 
agree about the interpretations of these facts. If the theoretical differ- 
ences are irreconcilable, and one position eventually wins out over the 
other, there will ultimately be an effect upon practice. But advice for prac- 
tical people today need not wait for the resolution of these theoretical 
controversies. 

Here are a few statements upon which I would expect a majority 
of lear nin g theorists to agree. It would be too much to ask for perfect 
agreement, for some statements require many qualifications, and there are 
always a few theorists who are sticklers for wording. 

For a serious beginning, see Rotter (1954). 

40 Much of this section is adapted from Hilgard (1953b). 

41 Newman (1951), page 411. 
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1. In deciding who should learn what, the capacities of the learner 
are very important Brighter people can learn things less bright ones 
cannot learn; in general, older children can learn more readily than 
younger ones; the depline of ability with age, in the adult years, depends 
upon what it is that is being learned. 

2. A motivated learner acquires what he learns more readily than 
one who is not motivated. 42 The relevant motives include both general 
and specific ones, for example, desire to learn, need for achievement 
(general), desire for a certain reward or to avoid a threatened punish- 
ment (specific). 

3. Motivation that is too intense (especially pain, fear, anxiety) may 
be accompanied by distracting emotional states, so that excessive moti- 
vation may be less effective than moderate motivation for learning some 
kinds of tasks, especially those involving difficult discriminations. 

4. Learning under the control of reward is usually preferable to 
learning under the control of punishment. Correspondingly, learning 
motivated by success is preferable to learning motivated by failure. Even 
though the theoretical issue is still unresolved, the practical outcome must 
take into account the social by-products, which tend to be more favor- 
able under reward than under punishment. 

5. Learning under intrinsic motivation is preferable to learning un- 
der extrinsic motivation. 

6. Tolerance for failure is best taught through providing a backlog 
of success that compensates for experienced failure. 

7. Individuals need practice in setting realistic goals for themselves, 
goals neither so low as to elicit little effort nor so high as to foreordain 
to failure. Realistic goal-setting leads to more satisfactory improvement 
than unrealistic goal-setting. 

8. The personal history of the individual, for example, his reaction 
to authority, may hamper or enhance his ability to learn from a given 
teacher. 

9. Active participation by a learner is preferable to passive recep- 
tion when learning, for example, from a lecture or a motion picture. 

10. Meaningful materials and meaningful tasks are learned more 
readily than nonsense materials and more readily than tasks not under- 
stood by the learner. 

42 There may be theoretical argument whether motivation helps performance or 
habit strength, but this is an illustration where the practical outcome does not depend 
on the theoretical answer. Possibly the motivated learner merely practices more 
(Kimble, 1953); even so, he acquires what he needs to learn more readily than if he 
were not motivated, 
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11. There is no substitute for repetitive practice in the overlearning 
of skills (for instance, the performance of a concert pi anis t), or in the 
memorization of unrelated facts that have to be automatized. 

12. Information about the nature of a good performance, knowledge 
of his own mistakes, and knowledge of successful results, aid learning. 

13. Transfer to new tasks will be better if, in learning, the learner 
can discover relationships for himself, and if he has experience during 
learning of applying the principles within a variety of tasks. 

14. Spaced or distributed recalls are advantageous in fixing material 
that is to be long retained. 

These points are neither systematic nor comprehensive. They are 
listed merely in order to add concreteness to the suggestion that there 
are a number of useful generalizations about which students of learning 
are in substantial agreement. 

Improving the Contributions of Psychology to the Practical 

Understanding and Control of Learning 

If psychology is to be of greater help in gearing itself to problems of 
training/and to the other practical problems of learning, there are a few 
suggesjnons that can be offered. 

Validate the principles developed in the laboratory in real life 
situations. A theoretical principle cannot simply be borrowed and used 
to guide practice. It must be first tried out under the conditions of actual 
life. In real life, learning occurs in a social setting, with constraints of 
time arjj& personnel and equipment. An instructor teaching a course, be- 
cause le does not have complete charge of his students, is not free to 
do everything he would like to do. Hence a laboratory learning principle 
has tqbe tried out in the classroom before we can be sure that it is useful. 

.u. Be tolerant of negative results . If two methods of teaching or 
training are to be tried out experimentally, we ought to accept for trial 
only methods that have a good deal to commend them, for example, 
skilled teachers who favor them. Under these circumstances we are very 
likely to come out with differences that are not significant statistically, 
even though the experiment is well designed and the sample adequate. 
Instead of being disappointed with such a finding, and concealing it, the 
result is well worth knowing. Within the practical limitations of classroom 
teaching it is quite possible that alternative methods are equally ac- 
ceptable, and this may prove very comforting to teachers who prefer one 
or tne other method. 

Enlist the co-operation of inventive practical people . Theories in 
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the psychology of learning do not automatically design or invent teach- 
ing methods. Very often experienced teachers have a flair for finding 
ways to motivate students, to gain their participation, to suit learning 
tasks to their ability levels. The psychologist seeking to give practical 
help will seek to collaborate with such persons, using his own methods, 
to be sure, to validate their procedures, but capitalizing on their in- 
ventiveness and experience in a situation with many features not present 
in his laboratory. 

‘ /Lean more heavily on studies of problem-solving and creativity 
than on studies of rote learning . The old “principles of economy” in 
memorization and in acquiring skill depended very largely on studies of 
rote learning, and hence emphasized massed and distributed practice, 
learning by parts or by wholes, consequences of warm-up and fatigue, 
the effects of recitation, and so on. While these principles have some 
relevance for learning, they are not very applicable in practice. The 
boy learning to ride his bicycle does not need advice on distributing prac- 
tice. If he assembles a motor car, he doesn’t need to be taught to space 
his reviews in order to remember the name of the carburetor or generator. 

Of far more practical importance is the ability to make relevant use 
of past experience in facing new problems, of maintaining motivation un- 
til a difficult task is completed or a baffling problem solved, of gaining 
confidence in oneself as a creative person, of learning how to diagnose 
a problem, how to fill gaps in necessary knowledge, how to call on ex- 
perts when necessary. In a rapidly changing culture such as ours in 
America, learning how to go about learning in the face of new demands 
is more important than learning to commit serial passages to memory. 
Let me not be misunderstood: you cannot think without information. 
The point is that you have to learn how to use information in relation to 
presented problems. 

A closer integration of theory with practice is desirable, and it is 
hoped that an increasing number of psychologists will find this task at- 
tractive. 

Pure versus Applied Research 

The foregoing discussion raises some of the old controversies over 
pure and applied research. A study of the history of science will show 
that there is no uniform relationship between pure and applied science 
as they develop. The practice of extracting ores preceded the appropri- 
ate understanding of chemistry by centuries. Diets upon which people 
grew to maturity were chosen for ages before there was a science of 
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nutrition. As theoretical and experimental science mature, the relation- 
ship between science and invention becomes more intimate. Gunpowder 
could be invented before physical science was far advanced, but the 
atomic bomb could not have been made without calling upon the re- 
sources of pure science. The scientific study of learning is today in an 
intermediate position. While its results can be helpful to educators and 
parents, there are many problems in the practical guidance of learning 
which cannot yet be handled upon the basis of well-established principles. 

At any one time, scientists in experimental and theoretical fields 
can work only on the problems upon which they are prepared to work 
with the conceptual and material tools at hand. These may or may not 
be appropriate to the practical problem calling for solution. Mothers must 
feed and rear their children as best they can while the children are grow- 
ing up, and physicians have to treat their patients when they are ill. 
Practical pursuits cannot wait. Hence some sorts of adjustment are made 
in the practical world, while the scientists go about their business. Chil- 
dren may have rickets because at the time nothing is known about 
vitamins, and patients may die because a new curative drug has not yet 
emerged from the laboratory. Scientists could not be blamed if their 
science had not yet discovered vitamins or was not yet prepared to 
synthesize the new drug. 

Because scientists have to develop appropriate methods and con- 
cepts before their results can become efficient regulators of practice, their 
concerns for a time may appear to be remote from practical affairs, and 
some of their disputes will seem to be quibbles over distinctions that do 
not matter. All this suggests the need for patience and tolerance toward 
experimentation which pushes back the boundaries of the known and 
toward theory construction which attempts to sharpen the conceptual 
tools with which scientists can work. 

One faulty interpretation of the relationship between pure and 
applied science is to be avoided. This is the interpretation that applica- 
tions, if they are to have any verifiable basis, must wait until there is a 
pure science ready to be applied. A corollary of this interpretation is 
the confident hope that once the pure science is in order, applications 
will follow automatically. Neither the interpretation nor the corollary 
corresponds to reality, for it is quite possible to do applied research be- 
fore the problems of pure science are settled, and it is seldom if ever 
possible to apply scientific principles directly to practical situations with- 
out some empirical tailoring to make them fit. 

An adequate research program in the applied psychology of learn- 
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ing would rest in part upon the findings in the experimental studies of 
learning, but it would consist in much more than the making of sug- 
gestions on the basis of general principles. There must finally be experi- 
mental testing in the school, or on the playground, or in the shop — wher- 
ever the application is to be made. 

If in the end a principle has to be experimented upon at the point 
of application, it might be asked, why do laboratory research at all? If 
what we do with animals has to be repeated with children, why not 
work with children in the first place? There are several answers. For one 
thing, we do some things with animals which we cannot do with children. 
For another thing, we make many false starts in the laboratory, and it 
would be wasteful to subject school children to all the things we try 
in the laboratory. Again, in the laboratory we are able to control condi- 
tions as we are not able to control them in the school. A principle once 
discovered in a better controlled situation can be validated in a less-well- 
controlled one . 4S Even though the sole aim of learning experimentation 
were practical, it might prove economical to work in the laboratory in 
order to find the leads worth testing in practice. 

Research on practical problems, if well done, not only adjusts theory 
to practice, but contributes to substantiating, refuting, or extending 
theoretical knowledge. It is a mistake to distinguish too sharply between 
pure and applied research, if both are good research. It is true, however, 
that much applied research is necessarily concerned with problems bound 
to specific times and places, and so lacks the universality of pure re- 
search. It is these local and temporary features which lead to the judg- 
ment that applied research is inferior, and tend to reduce the prestige 
of the applied scientist relative to the pure scientist. When the relation- 
ships between pure and applied research are properly worked out, the 
requisite skills are of a high order however the labor is divided, and there 
is no reason why prestige should not be more equitably distributed. 
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Freud’s views on, 321-322 
functionalist views on, 350-362 
in gestalt theory, 252 
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Kessen), 423-424 

expectancy versus habit (Brunswik), 
177 
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on typical problems of learning, 113- 
115 



562 

Skinners views ( continued ) 
on verbal behavior, 107-108 
Woodworth’s views and, 338 
Smith’s (M. H., Jr.) spread-of-effect ex- 

S eriment, 35-36 

-learning theory (Rotter), 450-451 
Solution learning (Mowrer), 413 
Specificity doctrine (Thorndike), 43-45 
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Thorndike’s views and, 203, 204, 217 
typical problems of .learning and, 215- 
216 
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